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This  thesis  presents  a  comprehensive  set  of  hierarchically  organized  modular 
analytical  models  developed  for  the  performance  evaluation  of  multiprogrammed 
virtual-memory  time-shared  computer  systems  using  demand  paging.  The  hierarchy 
of  models  contains  a  user  behavior  model,  a  secondary  memory  model,  a  program 
behavior  model,  a  processor  model,  and  a  total  system  model.  This  thesis  is 
particularly  concerned  with  the  last  three  models.  The  program  behavior  model 
developed  in  this  thesis  allows  us  to  estimate  the  frequency  of  paging  expected 
on  a  given  processing  system.  The  processor  model  allows  us  to  evaluate  the 
throughput  of  a  given  multi-processor  multi-memory  processing  system  under 
multiprogramming.  Finally,  the  total  system  model  allows  us  to  derive  the 
response  cime  distribution  of  an  entire  computer  system  under  study. 

Since  all  major  factors  (such  as  various  system  overhead  times  and  idle  times) 
which  may  decrease  a  system's  computational  capacity  available  for  users'  useful 
work  are  explicitly  considered  in  the  analyses  using  the  above  models,  the 
performance  predicted  by  these  analyses  is  very  realistic.  A  comparison  of  the 
performance  of  an  actual  system,  the  Multics  system  of  M.I.T.,  and  the  corre¬ 
sponding  performance  predicted  by  these  analyses  confirms  the  accuracy  of 
performance  prediction  by  these  models.  Then,  these  analyses  are  applied  to 
the  optimization  of  computer  systems  and  to  the  selection  of  the  best  perform¬ 
ing  system  for  a  given  budget.  The  framework  of  performance  evaluation  using 
these  hierarchically  organized  analytical  models  guides  human  intuition  in 
understanding  the  actual  performance  problems  and  provides  us  with  reliable 
answers  to  basic  performance  questions  on  system  throughput  and  response  time. 
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ABSTRACT 

This  thesis  presents  a  comprehensive  set  of  hierarchically 
organized  modular  analytical  models  developed  for  the  performance 
evaluation  of  multiprogrammed  virtual-memory  time-shared  computer 
systems  using  demand  paging.  The  hierarchy  of  models  contains  a  user 
behavior  model,  a  secondary  memory  model,  a  program  behavior  model,  a 
processor  model,  and  a  total  system  model.  This  thesis  is  particularly 
concerned  with  the  last  three  models.  The  program  behavior  model 
developed  in  this  thesis  allows  us  to  estimate  the  frequency  of  paging 
expected  on  a  given  processing  system.  The  processor  model  allows  us  to 
evaluate  the  throughput  of  a  given  multi-processor  multi-memory 
processing  system  under  multiprogramming.  Finally,  the  totaJ  system 
model  allow*  us  to  derive  the  response  time  distribution  of  an  entire 
computer  system  under  study. 

Since  all  major  factors  (such  as  various  system  overhead  times  and 
idle  times)  which  may  decrease  a  system's  computational  capacity 
available  for  users'  useful  work  are  explicitly  considered  in  the 
analyses  using  the  above  models,  the  performance  predicted  by  these 
analyses  is  very  realistic.  A  comparison  of  the  performance  of  an 
actual  system,  the  Multics  system  of  M.I.T.,  and  the  corresponding 
performance  predicted  by  these  analyses  confirms  the  accuracy  of 
performance  prediction  by  these  models.  Then,  these  analyses  are 
applied  to  the  optimization  of  computer  systems  and  to  the  selection  of 
the  best  performing  system  for  a  given  budget.  The  framework  of 
performance  evaluation  using  these  hierarchically  organized  analytical 
models  guides  human  intuition  in  understanding  the  actual  performance 
problems  and  provides  us  with  reliable  answers  to  most  of  the  basic 
quantitative  performance  questions  concerning  throughput  and  response 
time  of  actual  modern  large-scale  time -shared  computer  systems. 
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CHAPTER  1 


THE  PERFORMANCE  EVALUATION  PROBLEM 


1.1.  Motivation 

Almost  ten  years  have  passed  since  the  appearance  of  the  first 
general-purpose  time-sharing  computer  system  marked  by  the  CTSS  system 
l C9 J .  This  system  is  still  in  operation  at  M.I.T.  although  it  is  being 
replaced  by  later  time-sharing  computer  systems  because  of  its  limi¬ 
tations  on  supporting  sophisticated  users.  Many  other  systems  have 
been  developed  in  the  effort  to  correct  the  deficiencies  of  earlier 
systems.  Notably,  the  Multics  syst-m  [C10,C12],  the  successor  of  CTSS, 
anc  the  IBM  360  model  67  system  [A3, LI]  incorporated  many  elaborate 
ideas  into  their  design  to  efficiently  provide  sophisticated  users  with 
their  own  virtual  computers.  These  large-sc.^le  systems  have  stimulated 
the  development  of  a  number  of  smaller  systems  such  as  GE  235,  RCA 
Spectra  model  46.  CDC  3300,  DEC  PDP-10,  and  XDS  940  and  Sigma  7. 

As  pointec.  out  by  Rosen  [R2],  especially  large-scale  time-sharing 
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systems  (the  Mutlics  system  and  the  360/67  system)  have  fallen  far 
short  of  the  performance  anticipated  in  the  initial  design  stage.  The 
elaborate  ideas  incorporated  into  these  systems,  such  as  paging,  segmen¬ 
tation,  multiprogramming,  multi-processing,  and  memory  hierarchy, 
combine  to  create  an  enormously  complex  system  and  therefore  it  was 
hard  to  estimate  how  efficient  such  an  ambitious  system  would  be. 
Moreover,  it  was  difficult  to  estimate  the  impact  of  the  behavior  of 
sophisticated  user  programs  in  this  environment  upon  the  system.  In 
fact,  it  is  not  easy  even  now  to  predict  the  performance  (e . g ., through¬ 
put,  response  time,  etc)  of  such  complex  time-shared  computer  systems; 
it  is  generally  believed  that  the  most  accurate  prediction  of  the 
performance  of  a  system  in  question  is  obtained  by  extrapolating  the 
observed  performance  of  ''similar"  existing  systems.  But  a  little 
investigation  often  reveals  that  each  system  is  very  different  in  hard¬ 
ware,  software,  and  user  characteristics  from  others.  Therefore,  other 
existing  systems  do  not  directly  provide  us  with  a  reliable  performance 
projection  of  a  system,  although  an  examination  of  performance  differ¬ 
ences  of  similar  systems  may  be  very  interesting.  Some  examples  of 
difficult  questions  on  the  performance  of  these  modern  large-scale 
time-shared  computer  systems  are  listed  for  reference  in  Table  1-1. 

It  is  obvious  that  we  need  some  sort  of  comprehensive  theory  of 
performance  evaluation  th*t  is  capable  of  answering  these  quantitative 
performance  questions  concerning  hardware,  software,  and  user  character¬ 
istics  of  these  computer  systems. 


Table  1-1  Some  Difficult  Performance  Questions 


What  is  the  best  configuration  to  provide  time-sharing  s  rvice  for 
200  sophisticated  users  at  a  place  such  as  M.I.T.? 

What  is  the  maximum  number  of  users  that  can  be  supported  by  a  certain 
system  of  a  given  configuration,  without  discouraging  them  with  ex¬ 
cessively  slow  responses? 

What  are  the  future  procurement  plans  to  improve  the  throughput  of  a 
certain  system  if  that  system  must  evolve?  Should  we  purchase  another 
processor,  another  unit  of  primary  memory,  or  a  faster  secondary 
memory  device? 

How  sensitive  is  the  system  performance  to  the  changing  user  charac¬ 
teristics? 

Does  sharing  affect  the  cost/performance  of  a  given  system  very  much? 

How  much  does  multiprogramming  improve  the  performance  of  a  certain 
system? 

How  can  we  keep  the  system  performing  optimally  by  tuning  system 
parameters  when  a  certain  system  overhead  is  halved? 

What  is  the  optimum  page  size  for  a  proposed  system?  How  sensitive 
is  the  performance  to  the  page  size? 
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1.2.  Computer  Systems  to  be  Studied 

This  thesis  will  be  concerned  with  performance  evaluation  of  these 
complex  systems,  i.e.,  mult iprogrammed  vircual -memory  time-shared 
computer  systems  using  demand  paging.  Before  beginning  a  discussion  of 
the  performance  evaluation  problems,  a  brief  description  of  such  comput¬ 
er  systems  to  be  studied  is  given  in  this  section,  in  order  to  avoii 
ambiguities  associated  with  the  complex  structure  of  these  systems. 

A  typical  system  is  schematically  depicted  in  Figure  1-1.  The 
entire  system  is  composed  of  a  processing  system  and  a  finite  population 
of  interactive  users  at  their  terminals.  Each  interactive  user  of  the 
computer  system  thinks  for  a  while  and  then  requests  a  computation 
(hereafter  called  a  Job)  to  be  performed  by  the  processing  system,  by 
typing  a  command  line  at  his  terminal.  The  job  thus  requested  is  re¬ 
ceived  and  first  jiaced  in  the  memory  queue  by  the  processing  system. 

The  processing  system  is  assumed  to  have  more  than  one  processor, 
and  a  two-level  virtual-memory  consisting  of  multi-unit  primary  memory 
(usually,  core)  as  the  first  level  and  large  secondary  memory  as  the 
second,  as  shown  in  Figure  1-2.  This  multi-processor  multi-memory 
processing  system  is  assumed  to  have  a  two-dimensional  address  trans¬ 
lation  mechanism  using  both  segmentation  and  PaBin&  lD2l-  When  this 
processing  system  finds  some  space  in  primary  memory  that  can  be  allo¬ 
cated  to  a  new  user  job,  one  of  the  jobs  in  <^is  moved,  under  the 
First-Come  First-Served  (FCFS)  discipline,  to  the  processor  queue  Qp. 

The  jobs  in  are  scheduled  one  by  one,  (usually)  under  the  FCFS 
discipline,  for  a  processor’s  service  when  one  of  the  processors  becomes 

executed  (in  the  running  state  of 


available,  and  then  each  job  is 


Multiprogramming  of  Degree 
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Figure  1-1  Mult iprogranmed  Virtual-Memory  Time-Shared  Computer  System  Using  Demand  Paging 


Processors  Primary  Memory 
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Figure  1-2  Multi-Processor  Multi-Memory  Procersing  System 
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Figure  1-1)  by  the  procesror  until  it  encounters  a  part  (page),  of  its 

program,  that  is  missing  in  primary  memory  (  a  missing  page  fault  is 

said  to  have  occurred).  Control  of  the  processor  is  transferred  from 

the  user  job  to  a  supervisor  module,  named  the  page  fault  handler,  which 

then  requests  the  missing  page  to  be  brought  from  secondary  memory  for 

that  user  job,  under  a  demand  paging  strategy.  At  this  point,  the 

processor  becomes  available  for  another  job  in  Q  .  On  the  other  hand 

P  * 

the  page-faulted  user  job  enters  the  page-wait  state,  waiting  for  the 

requested  page.  When  it  is  eventually  transferred  from  secondary  memory 

to  primary  memory,  the  user  job  reenters  the  processor  queue  Q  (or  the 

P 

page-ready  state),  becoming  ready  for  another  service  by  a  processor. 
Because  there  usually  exist  several  jobs  competing  for  service  by 
processors,  each  job  is  executed  in  an  interleaved  fashion,  cycling 
through  the.  above  three  states  many  times.  When  the  job  is  eventually 
completed,  the  result  is  returned,  as  a  system  response,  to  the  terminal 
user  who  requested  this  job,  and  he  then  starts  thinking  about  which 
job  to  request  next. 

A  series  of  jobs  being  requested  by  a  user  is  generally  called  a 
process.  It  is  assumed  that  a  user  can  request  a  job  only  after  he 
receives  the  system's  response  to  his  preceeding  job  request.  If  a 
user's  job  is  located  within  the  section  of  Figure  1-1  surrounded  by  a 
broken  line,  his  process  is  raid  to  be  eligible  because  it  is  eligible 
for  a  processor's  service;  otherwise  it  is  said  to  be  ineligible. 

The  number  of  eligible  processes  existing  at  any  instant  should  be 
determined  by  considering,  at  least,  their  demand  for  primary  memory 
available  to  eligible  user  processes,  in  order  to  avoid  thrashing. 


19 


i.e.,  excessive  competition  for  primary  memory  space  leading  to  a  less 
than  optimum  use  of  system  resources.  The  computer  system  to  be  con¬ 
sidered  is  actually  assumed  to  have  a  simple  (static)  mechanism  to 
avoid  thrashing,  called  multiprogramming  of  degree  q,  which  allows  a 
maximum  of  q  (a  constant)  processes  to  be  simultaneously  eligible. 

Summing  up,  the  computer  system  to  be  studied  is  characterized  by 
the  parameters  shown  in  Table  1-2.  This  table  lists  only  major  system 
parameters  which  are  believed  to  be  important  in  performance  evaluation. 
The  precise  definition  of  these  system  parameters  will  be  given  as  they 
are  introduced  in  the  thesis. 
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Table  1-2  Major  System  Parameters 

System  Hardware: 

number  of  processors 

si2e  of  primary  memory 

number  of  primary  memory  units 

size  of  secondary  memory* 

speed  of  processors  and  primary  memory 

speed  of  secondary  memory 

channel  organization  and  capacity* 

memory  cycle  interference 

System  Software: 

scheduling  algorithm* 

multiprogramming  algorithm  (degree  of  multiprogramming) 

page  replacement  algorithm 

page  size 

paging  overhead 

miscellaneous  overheads 

data-base  lockout 

Users  and  Their  Programs: 

number  of  interactive  users 
types  of  interactive  users* 
user's  think  time 

execution  time  required  by  each  user  interaction  (job) 
program  size 

reference  pattern  of  programs 
decree  of  sharing  among  user  processes 


*  Comments  will  be  given  in  Section  1.6. 
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1.3.  Nature  of  the  Problem 

As  the  complexity  of  time-shared  computer  systems  grew,  the  perform¬ 
ance  evaluation  of  these  costly  systems  became  vital.  Therefore,  many 
performance  evaluation  techniques  have  been  developed.  Lucas  [L4] 
recently  classified  these  techniques,  according  to  their  purposes  of 
performance  evaluation,  into  the  following  three  categories. 

(A)  System  selection  techniques 

(B)  Performance  pro  lection  techniques 

(C)  Performance  monitoring  techniques 

The  first  category  of  techniques  is  intended  to  select  a  particu¬ 
lar  system  from  various  systems  available  from  many  manufacturers  when 
system  performance  is  a  major  criterion  to  make  a  purchase  order.  Lucas 
suggests  that  synthetic  programs  (a  comprehensive  set  of  benchmark 
programs,  so  to  speak)  are  most  appropriate  in  this  category.  The 
second  category  of  techniques  are  intended  to  estimate  the  performance 
of  a  system  that  does  not  yet  exist  or  only  partially  exists.  Within 
this  category,  Lucas  claims  that  simulation  is  most  powerful  because  of 
its  relative  flexibility  in  modeling  complex  systems.  The  last  category 
of  techniques  is  intended  to  collect  data  on  the  actual  performance  of 
an  existing  system.  These  data  are  used  to  identify  the  operating 
condition  of  the  system  so  as  to  forecast  the  impact  of  changes  in  the 
system,  possibly  with  the  help  of  the  techniques  of  the  second  category. 
Monitoring  uses  both  hardware  and  software  methods. 

According  to  this  classification,  the  performance  evaluation 
techniques  to  be  presented  in  this  thesis  belong  to  the  second  category 
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and  are  especially  concerned  with  the  following  three  general  performance 
problems  of  modern  large-scale  time  shared  computer  systems. 

(1)  performance  prediction  for  a  given  configuration 

(2)  performance  optimization  for  given  (hardware)  configuration 

(3)  configuration  selection  for  a  given  budget 

Performance  prediction  means  a  functional  expression  of  system 
performance  (e.g.,  throughput,  response  time,  etc.)  in  terms  of  various 
system  parameters  concerning  hardware,  software,  and  user  characteristics 
of  the  system;  a  performance  prediction  technique  estimates  the  perform¬ 
ance  of  a  given  (hardware-software-user)  configuration.  Performance 
optimization  deals  with  the  problem  of  how  to  improve  the  performance 
of  a  given  system  without  changing  its  hardware:  the  system  performance 
is  optimized  with  respect  to  certain  adjustable  parameters  of  the 
operating  system  such  as  the  degree  of  multiprogramming,  the  page  size, 
and  various  resource  allocation  algorithms,  without  changing  the 
hardware  cost.  Lastly,  configuration  selection^  is  the  problem  of 
deriving  the  optimum  system  configuration  which  attains  the  best 
performance,  constrained  by  a  given  purchase  budget;  this  involves 
an  optimization  of  hardware  configuration  as  well  as  that  of  the  operat¬ 
ing  system. 


^The  configuration  selection  problem  is  simpler  than  the  system  selection 
problem  because  we  need  not  be  concerned  with  various  differences  of 
uifferent  systems  in  machine  structure  such  as  word  length,  width  of 
data  transfer  paths,  machine  instruction  repertory,  etc. 
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Unfortunately,  simulation  approaches,  recommended  for  this  category 
of  pvobl“ms  by  Lucas,  require  enormous  amount  of  development  effort 
and  operating  cost.  This  becomes  especially  apparent  if  a  simulation 
model  includes  many  micro  and  macro  operations  in  evaluating  the  per¬ 
formance  of  various  possible  configurations  of  a  certain  system.  More¬ 
over,  simulation  approaches  tend  to  lack  the  capability  of  yielding  a 
general  insight  into  the  cause-and-ef feet  relationship  of  performance 
problems.  To  compensate  these  weaknesses  of  simulation  approaches,  this 
thesis  explores  Lhe  possibility  of  using  analytical  models  in  tackling 
the  above  three  general  problems  of  computer  system  performance. 
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1.4.  Review  jf  Analytical  Computer  Models 

Because  analytical  models  will  be  developed  in  this  thesis  to 
project  the  performance  of  large-scale  time-shared  comnot-er  systems, 
this  section  briefly  reviews  and  examines  the  analytical  computer  models 
developed  as  performance  projection  techniques  until  now. 

These  models  are  siochastic  in  nature  and  their  analysis  usua  ily 
.nvolves  queuing  theory.  Because  it  is  generally  difficult  to  include 
many  mutually  related  system  parameters  (see  Table  1-2)  in  a  mathemati¬ 
cally  tractable  model,  most  of  these  analytical  models  are  concerned 
with  subsystem  behaviors.  They  may  be  classified  into: 

(a)  processor  scheduling  models 

(b)  secondary  memory  models 

(c)  multiprogramming  models 

(d)  program  behavior  models 

Processor  scheduling  models  usually  include  a  single  processor,  an 
infinite  (sometimes,  finite)  user  population,  and  a  scheduling  algorithm 
(e.g.,  First-Come  First-Served,  Round-Robin,  Processor  jJharing,  Feed- 
Back).  and  aim  to  study  the  effect  of  a  scheduling  parameter  (e.g., 
quantum  length)  upon  response  time  of  a  ussr  job  (conditioned  upon  its 
execution  time).  These  models  are  most  abundant  among  the  above  four 
classes  and  were  extensively  surveyed  by  McKinney  [M2).  For  example, 
general  analyses  by  RR,  PS,  anc  FB  algorithms  were  presented  respective¬ 
ly  by  Chang  [C2],  Baskeit  [B2j,  and  Schrage  [S5). 

Secondary  memory  models  are  intended  to  estimate  the  (average)  time 
to  fetch  a  block  of  Information  from  a  secondary  memory  device  (e.g.. 
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disk,  drum,  bulk  core,  etc.)  used  as  part  of  virtual  memory.  Fewer 
papers  have  been  published  in  this  area.  Coffman's  work  [C5.C6’  is, 
however,  worthy  of  a  special  note.  He  derived  the  average  time  required 
to  fetch  a  page  of  information  from  a  sector  drum  under  demand  paging, 

using  ar,  embedded  Markov  chafn  technique  {C8). 

Multiprogramming  models  are  relatively  newer  class  cf  models 
were  analyzed  by  Smith  [S7).  Wallace  and  Masson  (W2),  dugen  IB6).  Moore 
(M3),  Rice  [Rl),  et  al.  These  models  combine  processor  scheduling 
models  (with  a  FCFS  discipline)  and  secondary  memory  models,  making  a 
significant  step  toward  a  "system  model".  They  include  several  (a  fixed 
number)  jobs  under  multiprogramming,  each  of  which  is  serviced  by  a 
processor  and  then  by  one  of  the  secondary  memory  devices  in  a  cyclic 
fashion,  us  schematically  shown  in  Figure  1-3.  Service  time  of  each 
server (processor,  I/O  devices)  is  usually  assumed  to  be  exponentially 
distributed  and  branchings  are  specified  by  constant  (unconditonal) 
probabilities  (e.g.,  pt  of  Figure  1-3).  This  class  of  models  aims  to 
evaluate  the  effect,  upon  the  server  utilizations,  of  the  hardware  con¬ 
figuration  as  well  as  of  the  number  of  jobs  under  multiprogramming, 
i.e.,  the  degree  of  multiprogramming.  Smith  and  Wallace  et  al  formulated 
their  problems  as  a  Markov  process  and  numerically  obtained  the  perform¬ 
ance  of  these  models  using  a  powerful  queue  analysis  program  called  RQA 
{Wlj.  On  the  other  hand.  Buzen,  Moore.  Rice,  and  others  explicitly 
analyzed  the  performance  of  their  multiprogramming  models  using  Gordon 
and  Newell’s  method  (G2]  or  Jackson's  decomposition  thorem  concerning 
queuing  networks  IC7.J1J.  The  effect  of  non-exponential  service  times 
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Figure  1-3  Typical  Multiprogramming  Model 
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was  examined  by  Gaver  {G1 ] -  The  problem  of  optimizing  the  degree  oi 
multiprogramming  under  a  simple  assumption  was  studied  by  Wallace  and 
Mason.  The  effect  of  system  overhead  was  carefully  analyzed  by  Lewis 

and  Schedler  [L2]. 

Finally,  there  exists  another  class  of  analytical  models,  i.e., 
program  behavior  models.  These  models  are  intended  to  study  the  paging 
behavior  of  programs  being  executed  by  a  processor  within  a  limited 
amount  of  primary  memory.  There  seem  to  be  at  least  two  different 
popular  approaches  in  modeling  program  behavior,  Markovian  program 
models  [KU  and  Denning's  working-set  program  models  [Dl,  D3  ] .  The 
paging  behavior  of  programs  is  modeled  by  a  Markov  chain  in  the  first 
approach,  while  the  paging  behavior  is  measured  by  the  number  of  dis¬ 
tinct  pages  referenced  during  a  given  time  interval  in  the  second 
approach.  Both  of  these  attempt  to  derive  a  success  function,  i.e., 
the  probability  that  a  reference  is  made  to  a  page  already  in  primary 
memory  expressed  as  a  function  of  the  primary  memory  size,  for  a  certain 
page  replacement  algorithm.  However,  no  practically  useful  results  have 
been  obtained  about  the  dynamic  paging  behavior  of  programs. 

Although  many  analytical  models  have  been  developed  and  analyzed, 
there  still  exists  a  general  consensus  especially  among  system  designers 
that  these  models  are  not  good  enough  to  answer  the  performance  questions 
that  they  face  (see  Table  1-1).  This  is  probably  due  not  only  the 
system  designers’  insufficient  efforts  to  try  to  use  studied  models  in 
understanding  the  behavior  of  an  actual  system,  but  also  to  certain 
important  detects  of  these  analytical  models;  most  of  the  analytical 
models  developed  until  now  have  serious  weaknesses  in  at  least  some  of 
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the  following  aspects. 

(1)  System  behavior  versus  subsystem  behavior;  almost  all  analyti¬ 
cal  models  attempt  to  represent  only  a  subsystem1,  and  this 

is  the  very  rtason  why  many  system  analysts  abandon  an  analyti¬ 
cal  approach  and  choose  a  costly  simulation  approach.  Perhaps, 
we  need  another  modal  which  is  capable  of  combining  all  sub¬ 
system  models  meaningfully  so  as  to  estimate  the  performance 
of  the  entire  system. 

(2)  Paging  activities  and  multiprogramming  capability;  this  aspect 
of  the  modern  large-scale  time-shared  computer  system  is  not 
considered  in  the  processor  scheduling  models,  but  is  usually 
considered  in  the  multiprogramming  models.  Therefore,  we 
should  favor  the  latter  models.  The  program  behavior  models 
developed  until  now  arc  not  good  enough  to  consider  a  multi- 
programmed  situation. 

(3)  Space-domain  considerations;  the  size  of  primary  memory  is  not 
considered  in  the  multiprogramming  models.  Moreover,  these 
models  consider  only  a  single-processor  system,  in  spite  of 

the  increasing  interest  in  multi-processor  multi-memory  systems. 

(A)  Overhead  considerations;  almost  all  models  do  not  consider 

the  overhead  of  system  programs,  although  a  major  implication 

There  are  some  analytical  models  of  an  entire  system  [S4,T1].  However, 
these  models  are  apparently  too  simple  to  reasonably  represent  multi- 
progranmed  time-shared  computer  systems  using  demand  paging. 


of  modern  time  shared  computer  systems  with  paged  memory  is 
considered  to  be  system  overheid. 

(5)  Good  choice  of  performance  measure;  the  improvement  of  utili¬ 
zation  factors  of  system  resources  is  not  an  ultimate  goal. 
Better  performance  measures  are  the  system  throughput,  namely, 
the  amount  of  user  computation  accomplished  by  the  entire 
system  per  unit  time,  and  the  system  response  time  experienced 
by  interactive  terminal  users. 

We  will  attempt  to  correct  these  defects  of  the  existing  analytical 
models  in  this  thesis  research  sc  that  we  can  realistically  attack  per¬ 
formance  prediction  problems,  performance  optimization  problems,  and 
configuration  selection  problems. 


30 


1.5.  Approach  to  be  Takpn 

Modeling  of  multiprogrammed  virtual-memory  time-shared  computer 
systems  using  demand  paging  must  include  many  system  parameters,  as  seen 
io  Table  1-2.  It  is  actually  this  multiplicity  of  interacting  system 
parameters  that  has  prevented  analytical  computer  models  from  being  a 
reasonable  representation  of  these  systems  a.  their  entirety;  a  straight¬ 
forward  inclusion  of  all  important  system  parameters  in  an  analytical 
model  would  not  allow  mathematical  traceability  in  Its  analysis.  Inter- 
kingly,  it  has  been  observed  also  in  simulation  approaches  that  a 
simulation  model  which  attempts  to  account  for  both  "macro-operations" 
(*.g.,  user's  think  time)  and  "micro-operations"  (e.g.,  state  transitions 
of  eligible  processes,  dynamic  program  behavior)  can  be  prohibitively 
p  nslve  to  run  [B4J.  On  the  contrary,  the  results  obtained  from  the 
oversimplified  analytical  models  cannot  convince  the  system  designers  of 
rheir  practical  value  as  a  design  aid.  Therefore,  we  are  in  a  dilemma  of 
mathematical  tract.bility  and  multiplicity  of  interacting  system  parameters. 

However,  a  solution  to  this  dilemma  may  be  found  by  noting  that  all 
system  parameter,  do  not  necessarily  interact  and  that  the  system 
behavior  involves  varied  .  activiti-s  of  different  time  scales.  For 
example,  user  behavior  (e.g.,  think  time,  Jobs  to  be  requested,  etc.) 
tend,  to  be  logically  independent  of  dynamic  program  behavior  unless 
the  user  1,  encouraged  to  be  particul.ry  careful  about  the  program 
behavior.  He  find  that  the  activities  of  the  entire  system  may  be 
categorized  into  "macro-activities"  (e.g.;  memory  queuing  of  user  Jobs, 
terminal  user  behavior— activities  on  the  order  of  seconds),  "micro- 
activities"  (e.g  ,  state  transitions  of  eligible  processes— millisecond 
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activities).  Thus,  we  can  obtain  several  semi- independent  activities 
of  the  entire  system  which  interact  only  through  their  prime  inter¬ 
relationships.  When  all  the  semi- independent  activities  are  connected 
according  to  their  prime  inter-relationships,  we  generally  obtain  hier¬ 
archically  organized  system  activities.  One  such  result  is  shown  in 
Figure  1-4.  The  approach  to  be  taken  in  analyzing  the  behavior  of  the 
entire  ccmpwter  system  in  this  thesis  is  to  develop  a  separate  model 
for  each  semi-independent  activity  of  this  hierarchy  in  such  a  way  that 
any  model  has,  as  its  input  parameters,  at  least  one  system  parameter 
produced  (as  an  output)  by  each  immediate  lower-level  model,  if  such 
models  exist,  as  well  as  its  own  set  of  system  parameters.  A  system 
parameter  produced  as  an  output  of  a  model  can  be  regarded  as  a  perform¬ 
ance  index  of  that  model.  If  such  a  performance  index  is  not  the  output 
of  the  top-level  model,  it  is  called  an  intermediate  performance  measure. 
Now  it  is  clear  that  the  entire  computer  system  is  represented  by  a  set 
of  hierarchically  organized  modular  models. 

We  will  use.  a  particular  hierarchy  of  modular  models  depicted  by 
Figure  1-4.  It  includes  the  following  five  modular  models. 

(1)  program  behavior  model(s) 

(2)  secondary  memory  model 

(3)  user  behavior  model 

(4)  processor  model 

(5)  total  system  model 

The  program  behavior  model(s),  to  be  developed  for  the  behavior  of 
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Figure  1-4  Hierarchical  Organization  of  Modular  Models 
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programs  being  executed  on  the  processing  system,  aim(s)  to  derive  the 
mean  length  of  program  execution  before  a  page  fault  (that  is,  the  mean 
time  between  page  faults  denoted  as  "mtbpf")  as  a  function  of  various 
system  parameters  such  as  program  characteristics,  the  primary  memory 
size,  the  page  replacement  algorithm  being  used,  the  page  size,  the 
degree  of  sharing  among  eligible  processes,  the  degree  of  multipro¬ 
gramming,  etc.  The  secondary  memory  model  aims  to  derive  the  mean 
length  of  time  required  to  fetch  a  page  of  information  from  the  second¬ 
ary  memory  system  (that  is,  the  mean  page  fetch  time  denoted  as  "mpft") 
as  a  function  of  memory  device  characteristics,  the  channel  ..  'ganization, 
the  input/output  traffic  intensity,  etc.  Because  the  performance  of  the 
secondary  memory  system  (disk,  drum,  bulk  core,  or  a  combination  of 
these)  can  be  reasonably  predicted  by  a  simple  queuing  model  like  an 
M/G/l  queue  [C8],  or  by  the  fairly  complex  drum  model  of  Coffman  [C5, 

C6],  the  thesis  will  not  develop  a  new  model  in  this  area.  The  user 
behavior  model  exists  for  a  user's  function  to  think  and  request  a 
computation  called  a  user  job.  In  this  thesis,  we  simply  assume  that 
think  time,  which  a  user  needs  to  decide  and  request  a  job,  and  exe¬ 
cution  time  required  by  such  a  user  job  are  both  exponentially  distrib¬ 
uted  because  they  are  known  to  be  roughly  exponential  [S4];  we  will  not 
be  concerned  with  any  internal  mechanism  of  these  interactive  users 
(e.g.,  under  which  condition  a  user's  think  time  tends  to  be  exponential). 

Immediately  above  the  program  behavior  model  and  the  secondary 
memory  model,  there  exists  the  processor  model.  This  is  a  multipro¬ 
gramming  model  improved  in  such  a  way  that  the  performance  of  a  multi- 
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programming  mechanism  implemented  on  a  multi-processor  multi-memory 
processing  system  (see  Section  1.2.)  can  be  measured  in  percentile 
-throughput,  i.e.,  the  percentage  of  the  system's  computational  capacity 
utilized  for  users'  useful  work.  This  means  that  the  objective  of  the 
processor  model  is  to  derive  the  percentile  throughput  of  the  system  as 
a  function  of  the  number  of  processors  and  primary  memory  units,  their 
operating  speed,  the  speed  of  secondary  memory,  various  overheads  of  the 
operating  system,  the  degree  of  multiprogramming,  user  process  behavior, 
multi-processor  interferences  (such  as  memory  cycle  interference  and 
data-base  lockout),  and  so  on.  Particularly  as  for  the  user  process 
behavior  and  the  speed  of  secondary  memory,  it  will  be  assumed  that  the 
time  between  page  faults  and  the  page  fetch  time  are  both  exponentially 
distributed  with  mians  predicted  by  the  analyses  of  the  program  behavior 
model  and  the  secondary  memory  model  respectively.  Because  percentile 
throughput  turns  out  to  be  linearly  proportional  to  the  throughout  of 
the  processing  system,  i.e.,  the  average  number  of  user  jobs  completed 
per  unit  time  (this  relation  will  be  shown  later  in  Chapter  3),  the  der^ 
Ivation  of  percentile  throughput  using  the  processor  model  is  called  a 
.throughput  analysis.  The  throughput  analysis  using  the  above  multi¬ 
programming  model  aims  to  overcome  the  defects  of  the  existing  multi¬ 
programming  models  concerning  the  space-domain  considerations,  the 
overhead  considerations,  and  the  performance  measure. 

At  the  top  level,  there  exists  the  total  system  model.  This  model 
combines  all  other  models  in  the  hierarchy,  and  aims  to  derive  a  distri- 
bution  of  the  system's  response  time  as  an  explicit  function  of  the 
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number  of  processors,  the  number  of  interactive  users,  the  percentile 
throughput  predicted  by  the  processor  model,  and  the  user  characteris¬ 
tics  such  a^  user  think  time  and  execution  time  of  a  user  job  both  of 
which  are  assumed  to  be  exponentially  distributed.  This  means  that  it 
will  be  possible  to  derive  the  percentile  response  time,  i.e.,  the  time 
limit  which  guarantees  that  a  certain  proportion  (e.g.,  90  percent)  of 
response  times  is  shorter  than  that  limit,  as  a  function  of  various 
system  parameters  describing  the  configuration  of  the  computer  system. 
Therefore,  the  result  to  be  presented  is  much  more  informative  than  the 
result  of  the  average  response  time,  because  the  fluctuations  of  re¬ 
sponse  times  around  the  average  can  be  accurately  predicted.  It  seems 
that  this  is  the  first  derivation  of  the  distribution  of  response  time 
of  a  multiprogrammed  time-shared  computer  system  using  demand  paging. 

It  should  be  pointed  out  that  the  response  time  and  the  system 
throughput  predicted  in  this  approach  reflect  not  only  processor  hard¬ 
ware  characteristics,  system  software  (operating  system)  characteristics, 
and  user  behavior  characteristics,  but  also  secondary  memory  character¬ 
istics  and  program  behavior  characteristics.  This  was  made  possible  by 
the  use  of  a  set  of  hierarchically  organized  modular  models  each  of 
which  involves  only  a  few  system  parameters,  while  the  entire  computer 
system  includes  more  than  twenty  important  system  parameters.  This 
thesis  will  consider  all  the  system  parameters  of  Table  1-2  except  those 
starred,  in  evaluating  the  performance  of  multiprogrammed  virtual-memory 
time-shared  computer  systems  using  demand  paging,  explained  in  Section 


1.2. 
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Aside  from  the  fact  that  this  hierarchical  organization  permits 
mathematical  tractability  and  the  existence  of  such  a  multiplicity  of 
interacting  system  parameters,  it  has  the  following  equally  important 
aspects . 

(1)  This  organization  allows  human  understanding  of  the  very 
complicated  behavior  of  modern  complex  computer  systems  and 
therefore  provides  us  with  an  insight  into  the  cause-and- 
effect  relationships  existing  in  such  systems. 

(2)  If  the  behavior  of  a  subsystem  becomes  known  through  the 
experimental  or  monitoring  studies  of  a  partially  existing 
system,  the  behavior  of  the  corresponding  model  can  be  re¬ 
placed  by  the  actual  behavior  of  that  subsystem.  Therefore, 
available  information  about  subsystems  can  be  usefully  util¬ 
ized  in  predicting  the  performance  of  the  entire  system  with 
an  increased  accuracy. 

(3)  Change  in  subsystem  configuration  (e.g.,  secondary  memory 
configuration),  expected  especially  on  evolving  computer 
systems,  does  not  require  an  overall  change  of  the  entire 
analysis;  only  the  parameters  of  that  subsystem  need  to  be 


modified . 
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1.6.  Some  Comments  about  the  Models 

Comments  may  be  in  order  as  to  why  some  of  the  system  parameters 
of  Table  1-2  are  out  of  consideration  in  this  thesis.  In  studying  the 
paging  activities  under  multiprogramming,  this  thesis  assumes  a  two- 
level  virtual  memory  consisting  of  primary  and  secondary  memories.  In 
this  context,  the  key  parameter  of  secondary  memory  is  not  its  size  but 
its  speed,  and  therefore  it  is  simply  assumed  in  the  thesis  that  the 
secondary  memory  system  is  large  enough  to  stci'e  all  the  system  and 
user  programs;  we  do  not  explicitly  consider  the  size  of  secondary 
memory.  As  for  the  channel  considerations,  it  will  be  assumed  for 
simplicity  of  the  analysis  that  the  processing  system  is  not  channel- 
limited.  Therefore,  the  dynamic  queuing  delay  associated  with  channel 
service  will  noc  be  explicitly  considered,  but  some  fixed  service  time 
by  channel  may  be  included  as  part  of  the  page  f«tch  time  of  secondary 
memory . 

Scheduling  algorithms  of  the  system  described  in  Section  1.2  are 
all  FCFS  and  any  alternative  algorithm  will  not  be  considered.  But,  as 
is  well  known,  actual  systems  usually  employ  more  sophisticated  algo¬ 
rithms.  For  example,  the  Multics  system  usee  a  refined  variation  of  FB 
scheduling  algorithm  [01]  with  a  pre-paging  and  post-purging  technique 
(rather  than  strict  demand  paging)  for  jobs  in  the  memory  queue.  It 
should  be  realized  that  what  we  want  to  accomplish  in  this  thesis  is  to 
develop  a  performance  evaluation  methodology  which  is  capable  of  de¬ 
riving  percentile  throughput  (or  system  throughput)  and  response  time 
of  a  "reasonably  basic"  system  like  the  one  described  as  a  typical 
system  in  Section  1.2,  as  a  function  cf  various  important  system  parame 
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ters.  Then,  it  becomes  clear  that  inclusion  of  an  alternative  sched¬ 
uling  algorithm  like  a  RR  or  FB  algorithm  (for  jobs  in  the  memory 
queue)  in  the  proposed  framework  of  the  performance  evaluation  theory 
is  not  very  attractive,  because  it  is  known  in  queuing  theory  [B2,C7] 
that  (overheadless)  scheduling  algorithms  cannot  affect  percentle 
throughput  or  average  response  time  if  the  execution  time  of  user  jobs 
is  only  probabilistically  known  and  follows  an  exponential  distribution, 
as  assumed  in  Section  1.2  and  as  observed  on  the  Multics  system,  for 
example;  scheduling  algorithms  however  tan  affect  the  variance  of 
response  time  because  of  their  Intended  favoritism  for  certain  (e.g., 
short)  jobs  at  the  expense  of  others  (C7).  Percentile  throughput  and 
average  response  time  only  deteriorate  if  the  process  (job)  switching 
overhead  associated  with  quantum  run-out  of  the  alternative  scheduling 
algorithm  is  not  negligible,  (Note  that  the  introduction  of  pre-paging 
and  post-purging  aainly  alma  to  reduce  this  overhead.)  Therefore,  in 
making  a  hard  effort  to  include  as  many  "Important"  system  parameters 
(those  which  affect  percentile  throughput  c-  average  response  time) 


lUnder  a  RR  algorithm,  for  example,  a  fixed  amount  of  time,  called  a 
quantum.  Is  given  to  each  process  becoming  eligible.  The  process  is 
allowed  to  remain  eligible  until  it  uses  up  this  amount  of  processor 
time.  If  this  process  needs  more  processor  time  to  complete  its  job, 
it  becomes  ineligible  and  joins  the  end  of  the  memory  queue.  This  state 
transition  path  is  not  included  in  Figure  1-1,  because  we  assume  a  FCFS 
algorithm,  i.e.,  the  RR  algorithm  with  a  infinitely  large  quantum. 
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as  possible  in  the  framework  of  this  approach,  .  lternative  scheduling 
algorithms  will  not  be  considered;  such  a  refinement  may  be  applied 
after  the  computer  system  undet  study  is  optimized  with  respect  to 
percentile  throughput. 

For  Jobs  in  the  processor  queue,  the  Multlcs  system  uses  a  pre¬ 
emptive  scheduling  algorithm  (to  accomplish  a  biased  primary  memory 
allocation  [B3])  with  a  dynamic  eligibility  control  mechanism  based  on 
a  working-set  estimate  of  user  processes  [01]  (rather  than  the  static 
eligibility  control  mechanism  of  Section  1.2).  This  elaboration  does 
improve  percentile  throughput,  but  we  consider  it  as  a  refinement  of 
the  FCFS  scheduling  (i.e.,  an  unbiased  primary  memory  allocation)  with 
the  static  eligibility  control  mechanism.  Therefore,  these  details  will 
not  be  considered  in  this  thesis. 

Finally,  it  should  be  mentioned  that  many  random  variables  are 
assumed  to  be  exponentially  distributed  not  simply  because  of  mathemati¬ 
cal  amenability  but,  more  importantly,  because  of  experimental  evidence. 
The  user's  think  time  measured  by  Scherr  [S4»]  is  roughly  exponential, 
and  the  execution  time  of  user  Jobs  being  monitored  by  a  built-in  meter 
of  the  Multlcs  system  is  almost  exponential.  Page  fetch  time  tends  to 
be  close  to  an  exponential  distribution  especially  if  secondary  memory 
consists  of  a  combination  of  a  frequently-used  high-speed  device  (e.g., 
drum)  and  a  less- frequently-used  low-speed  device  (e.g.,  disk).  The 
distribution  of  time  between  page  faults  has  not  been  measured  on  the 
Multlcs  system,  but  it  is  very  probable  that  this  distribution  appears 
like  an  exponential  distribution.  These  observations  of  MIT  systems, 
of  course,  do  not  necessarily  ’essen  the  importance  of  extending  the 
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exponential  assumptions  to  more  general  ones.  In  fact,  the  execution 
time  of  user  jobs  of  some  other  systems  is  reported  to  be  more  like  a 
hyper-exponential  distribution  or  a  Weibull  distribution,  possessing  a 
large  coefficient  of  variation  (A2). 
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1.7.  Thesis  Organization 

This  thesis  takes  a  bottom-up  approach  in  describing  the  models 
developed  for  hierarchically  organized  subsystems.  Each  chapter  of  this 
thesis  presents  results  obtained  for  a  different  subsystem. 

Chapter  2  is  devoted  to  the  studies  of  dynamic  program  behavior, 
and  presents  several  program  behavior  models  evaluating  mtbpf  as  a  func¬ 
tion  of  various  system  parameters.  Chapter  3  is  dedicated  to  the 
studies  of  the  processing  system,  and  presents  a  result  of  the  through¬ 
put  analysis  using  the  processor  model,  i.e.,  a  i.,odel  of  the  multi¬ 
programming  mechanism  implemented  on  the  multi-processor  multi-memory 
processing  system.  Chapter  4  is  devoted  to  the  studies  of  the  response 
time  characteristics  of  the  entire  computer  system,  and  presents  the 
result  of  the  response  time  analysis  using  the  total  system  model 
formulated  as  a  queuing  process.  Thus,  Chapter  2  through  Chapter  4 
collectively  consider  the  performance  prediction  problems,  i.e.,  the 
problems  of  expressing  computer  system  performance  as  a  function  of 
various  system  parameters  describing  its  configuration. 

Chapter  5  presents  many  numerical  results  obtained  by  these 
analytical  models  as  well  as  the  actual  performance  data  collected  from 
the  Multics  system  of  M.I.T.  The  validity  of  these  models  is  first 
examined  and  then  the  effect  of  various  system  parameters  upon  the  over¬ 
all  system  performance  (i.e.,  the  system  throughput  and  the  system 
response  time)  is  numerically  evaluated  using  these  models.  The  problem 
of  optimizing  a  given  computer  system  with  respect  to  certain  operating 
system  parameters  (the  degree  of  multiprogramming  and  the  page  size)  and 
the  problem  of  deriving  the  best  performance  system  for  a  given  budget 
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are  realistically  considered.  This  chapter  tackles  all  the  performance 
evaluation  problems  mentioned  in  Table  1-1  numerically. 

The  last  chapter,  i.e..  Chapter  6,  summarizes  the  problems  solved 
in  this  thesis  and  those  that  still  remain  to  be  solved.  Throughout  the 
thesis,  a  stochastic  modeling  approach  is  used.  This  thesis  therefore 
represent  an  application  of  some  known  techniques  (or  their  slight 
variations)  in  Markov  process  theory  and  queuing  theory  to  performance 
evaluation  problems  of  computer  systems.  The  emphasis,  however,  is  on 
the  identification  of  the  actual  performance  problems  and  the  develop¬ 
ment  of  a  framework  of  performance  evaluation  methodology. 

A  reader  who  is  intere  ted  simply  in  finding  out  the  application  of 
the  framework  of  performance  evaluation  developed  in  this  thesis  to  the 
performance  problems  of  actual  computer  systems  is  suggested  to  read 
Chapter  5  immediately  after  this  chapter.  If  a  reader  is  interested  in 
understanding  modeling  techniques  for  computer  systems,  he  should  read 
Chapter  2  through  Chapter  4  carefully.  Each  of  these  three  chapters  can 
be  read  individually  without  much  troublp.  If  he  decides  to  read  in 
this  way,  the  notation  and  glossary  included  at  the  end  of  this  thesis 
may  be  helpful.  Chapter  6  is  useful  to  those  who  are  searching  for 
research  topics  in  the  area  of  computer  system  performance  evaluation. 


CHAPTER  2 


PROGRAM  BEHAVIOR  ANALYSIS 


2.1.  Introduction 

Virtual  memory  computer  systems  have  enabled  the  memory  system  to 
appear  to  their  users  (programmers)  as  if  it  is  virtually  infinite  in 
size.  Therefore,  the  nasty  problem  of  carefully  overlaying  programs 
within  a  relatively  limited  primary  memory  space  has  been  removed  from 
the  user's  programming  considerations.  Virtual  memory  is  assumed, 
throughout  this  thesis,  to  be  implemented  by  "segmentation"  and  "paging" 
(D2),  on  a  two-level  physical  memory  system  consisting  of  primary  memory 
and  secondary  memory. 

These  computer  systems  have  a  mechanism  for  translating  program¬ 
generated  addresses  into  the  correct  physical  memory  addresses  [D2]. 

The  set  of  program-generated  addresses  is  called  the  virtual  address 
space  (virtual  memory)  and  the  set  of  physical  memory  addresses  the 
physical  address  space  (physical  memory).  Segmen  tat  ion  organizes  the 
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virtual  address  space  into  blocks,  called  segments,  of  arbitrary  size. 

By  allocating  each  program  to  its  own  segment,  programs  can  have  their 
own  "linear"  virtual  address  space  within  themselves.  This  means  that 
a  processor  accesses  euch  word  of  a  program  residing  at  a  certain  loca¬ 
tion  of  the  virtual  address  space,  using  a  two-component  address  (two- 
dimensional  address)  consisting  of  a  segment  name  (segment  number)  and 
a  word  name  (word  number).  Paging  further  organizes  each  segment  into 
blocks,  called  pages ,  of  a  fixed  size  (usually  1,024  or  512  words). 

This  means  that  a  word  name  is  represented  by  a  page  number  and  an 

offset.  Each  segment  usually  has  several  pages  of  information  (  proce¬ 
dure  and  data). 

Correspondingly,  the  physical  memory  is  organized  into  equal-size 
blocks  of  locations,  known  as  page -frames,  which  serve  as  sites  of 
residence  for  pages  of  segments.  Because  a  processor  can  execute  only 
that  portion  of  a  program  (segment)  which  resides  within  primary  memory 
which  is  relatively  limited  in  size,  the  operating  system  must  exercise 
a  special  algorithm,  called  a  paging  algorithm,  to  keep  only  the  pages 
being  needed  for  a  progress  of  program  execution  in  primary  memory  at 
all  times,  by  transferring  pages  of  the  program  back  and  forth  between 
primary  and  secondary  memories.  The  paging  algorithm  decides  when  to 
fetch  a  page  from  secondary  memory  and  which  page  to  be  removed  from 
primary  memory  when  one  of  the  pages  in  there  must  be  replaced  by  the 
page  to  be  fetched.  If  a  page  is  fetched  only  on  demand,  i.e.,  only 
after  that  page  is  found  "missing"  in  primary  memory  in  the  course  of 
program  execution  (a  page  fault  is  said  to  have  occurred),  then  the 
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fetch  rule  is  said  to  be  demand  paging;  otherwise,  it  is  said  to  be  pre¬ 
paring.  The  rule  used  to  select  a  page  for  removal  is  called  a  page 
replacement  algorithm.  These  two  functions  (page  replacement  and  fetch¬ 
ing)  of  the  paging  algorithm  aro  carried  out  by  a  supervisor  program, 
named  a  page  fault  handler,  which  is  a  part  of  the  "resident"  supervisor. 

In  fact,  the  processing  of  a  page  fault  requires  some  other  things 
such  as  the  bookeeping  of  the  page  table,  the  initiation  of  a  channel 
program,  the  handling  of  a  paging  interrupt,  etc.  [ Cl 1 ] .  These  super¬ 

visory  operations  required  to  process  a  page  fault  are  collectively 
called  the  paging  overhead.  From  the  above  explanation,  it  should  be 
clear  that  each  burst  of  continuous  program  execution,  i.e.,  the  running 
state  of  each  elegible  process  (see  Section  1.2.),  consists  of  (at  least) 
user  program  execution  and  paging  overhead  execution.  The  length  of 
thi:  continuous  program  execution  in  the  running  state  is  called  a  time 
between  page  faults  (abbreviated  as  tbpf)  and  the  length  of  user  program 
execution  within  tbpf  is  called  a  headway  between  page  faults  (abbrevi¬ 
ated  as  hbpf ) .  The  means  of  these  variables  are  respectively  called  the 
mean  time  between  page  faults  (mtbpf )  and  the  mean  headway  between  page 
faults  (mhbpf ) . 

As  explained  in  Section  1.2.,  the  computer  system  to  be  studied  is 
under  multiprogramming  of  degree  q.  This  means  that  q  eligible  (user) 
processes  compete  for  service  by  a  processor  and  for  the  use  of  page- 
frames  of  primary  memory.  Because  segmentation  enables  any  segment  to 
b»  shared  (a  single  copy  of  a  segment  in  virtual  memory  can  be  simulta¬ 
neously  used  by  different  users),  some  of  these  page-frames  are  physi¬ 
cally  shared  by  these  eligible  processes.  Therefore,  the  efficiency  of 
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primary  memory  is  heightened  by  segmentation.  However,  some  part  of  the 
primary  memory  space  is  not  available  to  non-resident  program,  (programs 
whcih  are  not  always  resident  in  primary  memory)  invoked  by  eligible 
processes,  because  resident  supervisor  programs  stay  at  certain  physical 
memory  addresses  at  all  times.  This  situation  is  schematically  depicted 
in  Figure  2-1.  The  total  primary  memory  space  with  M  page-frames  is 
divided  into  two  areas,  i.e.,  the  area  with  mr  page-frames  for  resident 
supervisor  programs  and  the  area  with  mn  page-frames  for  non-resident 
programs  (user  programs  and  non-resident  supervisor  programs).  Eligible 
user  processes  execute  not  only  non-resident  programs  but  also  resident 
(supervisor)  programs,  and  therefore  they  can  potentially  utilize  any 
part  of  the  primary  memory  space. 

Dynamic  paging  behavior  of  user  processes  in  this  environment  is 
known  to  have  a  great  impact  on  the  system's  overall  performance  [k2] 
and  therefore  it  has  been  extensively  studied  by  many  researchers.  It 
is  believed  that  at  least  the  following  system  parameters  are  significant 
in  determining  the  performance  of  programs,  which  is  usually  measured  by 
the  mhbpf  of  user  processes. 

(1)  program  characteristics  (size,  reference  pattern) 

(2)  primary  memory  size  (m  ,  ra  ,  M) 

r’  n 

(3)  paging  algorithm 

(4)  page  size 

(5)  degree  of  sharing  among  user  processes 

(6)  degree  of  multiprogramming  (q) 


Be  lady  [B3],  Brawn  and  Gustavson  [B5],  Coffman  and  Varian  [C4],  Hatfield 


resident 


non-resident  programs 


supervisor 


invoked  by  q  user  processes 


M  =  m  +  m 
r  n 


Figure  2-1  Allocation  of  Primary  Memory  Space 
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and  Gerald  [Hi],  Tsao,  Comeau,  and  Margolin  [T2],  and  Baer  and  Sager 
[Bl],  among  others,  have  experimentally  investigated  the  effect  of 
various  sysiem  parameters  upon  the  mhbpf  (or  its  equivalent)  of  user 
processes.  On  the  other  hand,  Denning  [D1,D3],  Mattson,  Gecsei,  Slutz 
and  Traiger  [Ml],  Chang  [Cl],  Aho,  Tanning  and  Ullman  [Al],  King  [Kl], 
Woolf  [W3],  and  others  have  carried  out  analytical  studies  of  the  dynamic 
behavior  of  user  processes. 

Research  performed  by  Denning,  Woolf,  and  King  represents  three 
different  analytical  (probabilistic)  approaches  to  the  problem  consider¬ 
ed  in  this  chapter,  and  therefore  some  comments  about  their  results  are 
in  order.  Denning  defined  the  working  set,  W(t, t),  of  a  user  process  at 
time  t  to  be  the  set  of  pages  that  the  process  has  referenced  during  the 
time  interval  (t-T,  t),  and  he  has  demonstrated  the  usefulness  of  his 
model  using  the  working  set  as  an  aid  for  guiding  one's  intuition  in 
understanding  program  behavior  and  possibly  as  a  basis  for  further 
program  behavior  analysis.  Woolf  considered  program  behavior  to  be  an 
execution  of  a  series  of  loops  and  successfully  evaluated  the  effects  of 
program  characteristics,  primary  memory  size,  and  page  sire.  However, 
this  elaborate  model  has  not  been  validated  against  actual  program 
behavior.  On  the  other  hand,  King  formulated  program  behavior  as  a 
Markov  process  in  evaluating  the  effect  of  paging  algorithms  as  well  as 
those  of  program  characteristics  and  primary  memory  size.  These  analyti¬ 
cal  studies  do  explain  various  behaviors  of  programs  experimentally 
observed,  but  none  of  these  has  proved  to  serve  as  a  practical  design 
aid  to  predict  a  program's  quantitative  performance  (e.g.,  mhbpf)  as  a 
function  of  the  system  parameters  mentioned  above.  In  particular,  the 


effects  of  sharing  and  multiprogramming  have  never  been  fully  studied. 

In  the  following  sections  of  this  chapter,  four  program  behavior 
models  mostly  using  a  Markov  process  approach  will  be  developed  to  study 
the  effects  of  those  six  system  parameters  mentioned  above  upon  the 
mhbpf  (or  its  equivalent)  of  a  program  (a  user  process).  All  of  these 
four  models  consider  at  least  the  effect  of  program  characteristics  and 
of  primary  memory  size,  but  each  of  them  concentrates  on  the  evaluation 
of  the  effect  of  a  particular  system  parameter.  The  models  to  be 
presented  in  Sections  2.2,  2.3,  and  2.4  are  especially  concerned  with 
the  effect  of  the  paging  algorithm  in  use,  the  effect  of  the  primary 
memory  size,  and  the  effect  of  the  page  size,  respectively.  The  model 
to  be  presented  in  Section  2.5  considers  a  multiprogramming  environment 
and  is  particularly  concerned  with  the  effect  of  the  degree  of  multi¬ 
programming  and  that  of  sharing  among  user  processes  under  multi¬ 
programming. 

Readers  who  are  only  peripherally  interested  in  the  analysis  of 
program  behavior  may  skip  Sections  2.2,  2.3,  and  2. 4, and  directly  move 
to  Section  2.5,  which  describes  a  macroscopic  paging  performance  model 
for  multiprogramming.  The  result  of  this  section  will  be  extensi'.  el.- 
used  in  the  succeeding  chapters.  Sections  2.2,  2.3,  and  2.4  present 
detailed  analyses  of  dynamic  program  behavior  and  can  be  read 
individually  without  difficulty.  However,  a  reader  can  grasp  the  basic 
approach  used  in  this  chapter  more  thoroughly  by  reading  then  m  me 


order  of  these  sections. 
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2.2.  First-order  Markov  Model  Applied  to  the  PRA  Studies 

In  Section  2.2.,  we  present  a  program  behavior  model  developed  to 

study  the  effect  of  £age  replacement  algorithms  (abbreviated  as  PRAs) 

as  well  as  those  of  program  characteristics  and  primary  memory  size, 

upon  the  mhbpf  of  a  user  process  operating  on  a  multiprogrammed  virtual 

memory  time-shared  computer  system  using  demand  paging1.  To  simplify 

the  analysis,  we  assume  that  the  primary  memory  space  (m  )  is  partition- 

n 

£d  into  q  equa!,-size  areas,  one  for  each  eligible  process,  and  that  the 
PRA  under  study  is  exercised  within  the  area  reserved  for  each  user 
process  (this  kind  of  PRA  is  called  a  local  PRA  as  opposed  to  a  globa 1 
PRA  [D2 ] ) .  Then,  the  analyses  of  program  behavior  under  multiprogramming 
reduces  to  that  of  program  behavior  under  uniprogramming. 


2.2.1.  Program  Behavior  Model  and  Paging  Algorithms 

Consider  a  program  consisting  of  a  set  P={p1,P2> . . .,Pn)  of  n  pages. 
As  the  program  is  executed  by  a  processor,  it  generates  a  sequence  rjr2 
■ • • rt • • •  of  references  to  pages,  where  if  rt=Pi  we  say  that  the  program 
references  page  pt  at  the  t-th  reference  (or  at  time  t).  We  assume 
that  such  a  sequence,  known  as  a  page  reference  string,  is  generated  by 
a  probabilistic  law  associated  with  the  program.  In  particular,  if  r  j 
depends  only  on  , .  . . , r^  ,  then  the  probabilistic  behavior  of  the 


The  result  described  in  Section  2.2.  can  be  regarded  as  an  extension 
of  King's  work  [Kl],  although  this  was  independently  derived.  He  used 
a  zero th-order  Markov  model  for  exactly  the  same  purpose. 
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program  forms  a  k-th  order  Markov  chain.  This  Markov  chain  actually 
can  be  reduced  to  a  first-order  Markov  chain  of  n  states  by  considering 
a  direct  product  (P  )  of  the  set  P,  i.e.,  a  state  augmentation  techenique 
[H2 ] .  Therefore,  we  will  consider,  without  loss  of  generality,  only 
programs  that  are  modeled  by  a  first-order  Markov  chain. 

The  probabilistic  behavior  of  these  programs  is  characterized  by  a 
transition  matrix  P  of  the  following  form  [H2],  where  an  element  of  the 


matrix  p^j  represents  a  conditonal  probability  of  a  reference  to  a 
page  Pj  at  time  t+1,  given  that  a  page  pi  was  referenced  at  time  t. 


tX+l  P!  P2  ‘••Pn 

P1  P11  P12  Pln 

p2  P21  P22  ’**  P2n 

P  =  ...  .  (3.2.1) 

“*  •  •  •  • 

•  •  •  • 

pn  Pnl  Pn2  * ’ ’  Pnn 


We  assume  that  the  elements  of  this  matrix  do  not  depend  on  time  t,  i.e., 
the  program  behavior  is  stationary* .  Hereafter,  we  call  this  type  of 
program  a  (stationary  first-order)  Markovian  program. 

Suppose  that  primary  memory  consists  of  a  set  M={ f fj , . •  • ,  fm}  of 
m  page- frames  where  m£n.  Although  the  most  general  paging  algorithm 
may  be  the  one  which  fetches  (replaces)  an  arbitrary  number  of  pages  at 
an  arbitrary  instant  of  time,  we  will  consider  the  algorithms  >~hich 

* Lewis  and  Yue  [L3]  have  pointed  out  that  program  behavior  represented 
in  (LRU  stack)  distance  reference  strings  rather  than  page  reference 
strings  tends  to  be  more  stationary. 
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fetch  (replace)  only  one  page  at  an  instant  of  a  page  fault,  i.e.. 
demand  paging  algorithms.  In  this  case,  the  studies  of  paging  algorithms 
reduce  to  those  of  PRAs. 

With  this  much  of  background,  we  can  formally  describe  a  PRA  for  M 
and  P,  as  follows. 

Definition1  A  page  replacemant  algorithm  for  M  and  P  is  a  five-tuple 
system  E=  (S,I,0,f,g)  where 

(1)  S  Is  a  finite  set  of  system  states  and  is  represented  by  a  direct 

product  of  finite  sets  (S  S  ,S  )  of  PRA  control  states,  memory 

amp  1 

states,  and  program  states,  i.e.. 


S  a  S  x  S  x  S  , 
a  m  p 


where 

(a)  S  is  a  finite  set  of  PRA  control  states,  S  *  {  s  }  ,  which 

a  a  a 

is  defined  separately  for  each  PRA  later, 

(b)  Sra  is  a  finite  set  if  memory  states,  indicating  a  set  of  pages 
resident  in  primary  memory,  such  that 


S  *  (s  |  s  c  P,  |  s  I  £  m}  , 
m  m  m  1  m1  ’ 


(c)  5^  is  a  finite  set  of  program  states, 

V  {sp  |spc  P)  'SP^  l}- 

such  that  if  8^=  {  p^}  then  page  p^  of  the  program  is  being 
referenced, 


This  definition  is  similar  to  that  of  Aho  et  al  [A1J  or  King  (KlJ. 
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(2)  I  is  a  finite  input  alphabet  which  contains  all  the  elements  of 
page  reference  strings,  i.e., 

I  «  {Pt  I  Pt  ®  *  )  £  P, 

(3)  0  is  a  finite  output  alphabet 


0  »  {O.lj 

whose  meaning  is  defined  below, 

(4)  f  is  a  next  state  function  of  the  mapping 

S  x  1  -•  S  such  that^ 

. ,  ,,  t  t  t  .  .  t+1  t+1  t+lv 

iff(s,s,s;p,)s(s  ,  s  ,  s  ) 
a’  m’  p  *1  v  a  ’  m  ’  p  ' 

,  t+1  ,  t+1  r  i 

then  p.e  s  and  s  =  fp, 
i  m  p  irIJ 

(5)  g  is  an  output  function  of  the  mapping 


S  x  1  -  0 


such  that 


8(v  v  Vfi>”  {, 


0  if  p  e  sC 
i  m 


otherwise  (i.e.,  page  fault) 


(6)  initial  states  of  Sfl,  S^,  and  are  empty  sets  (cp),  i.e., 


S  =  S  =  S  s  CD. 
a  m  p  r 


It  should  be  noted  that  a  PRA  for  M  and  P  is  a  (deterministic) 

transition-assigned  finite-state  automaton  (FSA)  with  states  S  x  S  x  S 

“  “  ~  amp 

Inputs  P,  and  outputs  0,  as  shown  in  Figure  2-2.  The  triple,  (s  ,s  .s 

amp 


The  superscript  t  means  "at  time  t".  For  example,  s1"  means  that  the 

a 

PRA  control  state  at  time  t  is  s  . 

a 
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represents  .  state  of  this  FSA  and  is  called  a  conjuration  of  the 
PRA  system  E  [A1,K11.  As  the  system  E  receives  a  page  reference  string 
r  r  ...r  ,  the  configuration  of  this  system  changes  like 

<  V  ■ dp)  >  <  V  V  %>’■■■’  (V  V  alh  (sa+1-  sm+1<  dpt+1  > 

generating  an  output  sequence  of  0's  and  f's.  The  output  sequence, 
which  indicates  when  a  page  fault  takes  place,  depends  only  on  a  page 
reference  strfng,  the  PRA  in  use,  and  primary  memory  sUe  <m) .  This 
implies  that  once  a  PRA  is  chosen  for  a  computer  system  under  study 
with  a  fixed  amount  (m)  of  primary  memory,  the  probabilistic  property  of 
the  occurence  of  "1"  in  the  output  sequence,  i.e.,  the  occurrence  of 
page  faults,  is  completely  determined  by  the  probabilistic  property  of 
the  corresponding  input  sequence,  i.e.,  the  probabilistic  law  given  by 

the  P  matrix. 


2.2.2.  Behavior  of  the  PRA  System 

We  will  proceed  to  determine  the  PRA  control  states  left  unspecifi¬ 
ed  in  the  previous  section  and  enumerate  the  number  of  distinct  states 
that  the  PRA  system  E  can  assume  for  each  of  various  PRAs.  Then,  noting 
that  the  probability  associated  with  the  transition  from  a  certain 
current  state  to  another  is  completely  determined  by  that  pair  of  states 
and  the  P  matrix,  we  will  see  that  the  probabilistic  behavior  of  the 
system  E  can  be  viewed  as  a  Markov  chain  defined  over  these  states. 

First  of  all,  note  that  for  any  demand  paging  PRA 
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if  p  es 
i  in 


£<VVVP1>  -■  (sa’sn,+(pl)'tpl)>  i£  pl<V  lsJ<1" 

^s^sm+CpJ-{p.},{p  })  if  p  *s  ,  is  ,  =m  p  es 
ami  j  i  i  m  •  m'  3  r  j  m 

(2.2.2) 


where  p  is  the  page  replaced  by  page  p.  and  s'  is  a  state  which  is 
J  la 

generally  different  from  s^  Observing  that  the  PRA  is  exercised  only 

When  ^SnJ=m  and  that  the  Pro8ram  is  assumed  to  operate  indefinitely 

within  primary  memory  of  m  page-frames,  we  will  neglect  all  the  system 

states  for  which  Is  | <m  ;  we  will  now  enumerate  the  number  (N  )  of 

m  s 

distinct  states  that  the  system  T,  can  assume,  for  each  of  the  following 
several  PRAs,  assuming  that  |s  |=m. 

Random  PRA  The  page  to  be  replaced  is  chosen  randomly  from  the 

pages  currently  contained  in  primary  memory.  Therefore,  the  PRA  itself 

need  not  remember  any  kind  of  priorities  associated  with  resident  pages 

i.e.,  the  PRA  is  "memory less",  and  N  is  determined  by  the  number  of 

s 

different  combinations  of  only  s  and  s  .  That  is  to  sav 

pm 


N  (  Random  )  =  m  (  n  ) 
s  m 


(2.2.3) 


LRU  PRA  This  well-known  algorithm  chooses  the  least-recently- 

used  page  (i.e.,  the  page  with  the  longest  time  since  the  last  refer- 

i 

ence)  and  removes  it  from  primary  memory.  This  algorithm  must  maintain 
an  ordered  list  (called  a  LRU  stack  [Ml])  of  resident  pages  to  remember 
how  recently  each  resident  page  was  referenced  in  the  past.  Noting 
that  there  exist  m!  different  ordered  lists  and  that  each  ordered  list 
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automatically  determines  s  ,  N  is  obtained  as 

p  s 


N  (LRU  )  =  m!(  ) 

s  m 


FIFO  PRA 


(2.2.4) 

This  easy- to- implement  algorithm  chooses  the  page  which 


stayed  in  primary  memory  for  the  longest  time  for  removal.  This 
algorithm  must  maintain  an  ordered  list  of  resident  pages  to  remember 
in  which  order  each  resident  page  was  fetched  into  primary  memory. 
Noting  that  there  exist  m!  different  ordered  lists  and  that  cannot 
be  implied  by  each  of  those  lists  in  any  way,  N  is  obtained  as 


N  (FIFO)  =  m!  m  (  n  )  (2.2.5) 

s  m 

LET  PRA  This  algorithm  removes  the  page  with  the  longest  expected 

time  (LET)  until  next  reference  [Ml,Al,D2].  Because  the  expected  time 

until  next  reference  to  each  page  can  be  analytically  calculated  only 

from  s  and  the  P  matrix  (as  will  be  shown  below),  the  PRA  itself  is 
P 

memory less.  Therefore,  Ng  is  obtained  as 

N  (  LET  )  =  m  (  n  )  (2.2.6) 

s  m 

Another  similar  PRA  removes  the  page  with  the  longest  expected 
time  since  the  last  reference.  Because  this  PRA  uses  information 
about  the  backward  distance  (into  the  past)  from  the  faulted  page  to 
each  resident  page,  we  call  this  PRA  the  backward  LET  PRA  ( B -LET  )  . 


N  is  given  by  Eq. (2.2.6).  We  call  the  former  LET  PRA  the  forward 
LET  PRA  ( F -LET  )  .  It  should  be  noted  that  the  LRU  PRA,  an  approximation 
to  the  B-LET  PRA,  is  often  used  on  an  actual  computer  system  in  the 
hope  that  the  B-LET  PRA  is  fairly  close  to  the  F-LET  PRA  for  typical 
(actual)  programs. 
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Now  we  are  ready  to  consider  a  Markov  chain  model  of  the  behavior 
of  the  system  E  for  each  of  these  PRAs.  For  this  purpose,  we  borrow 
an  example  of  a  Markovian  program  which  was  used  by  Chang  [Cl]  in 
proving  that  the  F-LET  PRA  does  not  necessarily  minimize  the  number  of 
p'-ge  faults  for  a  general  Markovian  program.  Chang's  example1  is  re¬ 
produced  in  the  form  of  a  P  matrix. 


P 


t\+1  P1  P2  P3 


P1 

P2 

P3 


T  0  «  J-e 
d  0 

Vo  i  o  / 


(2.2.7) 


We  assume  that  our  primary  memory  has  only  two  page-frames,  i.e.,  M  = 

t£i’£2J- 


pra  Let  {®,  j  }  denote  a  state  of  E  where  s  =  fp  p  }  and 

m  1  *1’  J 

sp=  ^Pii-  Then,  there  exist  the  following  six  states  in  S,  for  this 
PRA. 


s  -  t(®,  2  J,  {  1,8  J,  f@,  3  J.  {2,®},  3  ),  {  1,®J  ] 

Noting  the  P  matrix  of  Eq.  (2.2.7)  and  the  random  nature  of  this  PRA, 
we  obtain  the  transition  matrix  3  specifying  the  probabilities  of  tran¬ 
sitions  among  these  states. 


This  example  happens  to  have  zero  diagonal  elements  in  the  P  matrix, 
but  probably  most  actual  programs  tend  to  have  fairly  large  diagonal 
elements  corresponding  to  locality  of  memory  references. 
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1 


Then,  there  exist  the  following  twelve  states: 


f©,2}  fl$}  f@,3}  C2,Q>}  C®,3} 

Cl,®} 

{@,2} 

0 

c 

0 

1-c 

2 

0 

¥\ 

£l,@} 

d 

0 

0 

1-d 

2 

0 

1-d 

2 

{@,3} 

d 

2 

0 

0 

1-d 

d 

2 

0 

ss 

{2,©} 

0 

0 

1 

0 

0 

(d),3} 

0 

c 

2 

c 

2 

0 

0 

1-c  1 

Ci,®} 

0 

1 

2 

1 

2 

0 

0 

0  / 

LRU  PRA 

Let  {©. 

j  }  similarly  denote  a 

state 

of  £  where  s  » 
m 

tpf  "j1 

and  sp  =  {  pj 

;  page  p^is  least  recently  used.  Then,  there 

exist  the 

same  six  states  in 

S  as 

above. 

and 

the  ^  matrix  is  obtained 

as 

I 

I 

I 

f@,2}  Cl,®}  {§,3}  C2,@}  C®,3}  {1,®} 

r 

C®,2} 

0 

c 

0 

0 

0 

1-c 

j 

Cl,®} 

d 

0 

0 

1-d 

0 

0 

{@,3} 

d 

0 

0 

1-d 

0 

0 

Q 

C2,@} 

0 

0 

1 

0 

0 

0 

CC13} 

0 

c 

0 

0 

0 

1-c 

Cl,®} 

o 

0 

1 

0 

0 

0 

I 

FIFO  PRA 

Let  C(5^}  <enote 

a  state 

of 

£  where 

S  S3  f 

m 

i  1 

Pi,Pjj,  .p- 

{  p£} ,  and 

page  Pj  stayed  in 

primary  memory  for  the 

longest 

time. 
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S 


[1,2  MM;],  (i,,®),  [1,1),  [1,3],  [2,  3,] 
(4,,®)  ,  [2,1),  [I,  3  ),  t4',i),  [i,Ci),  (  1,  ffi)l 


The  state  transitions  are  specified  by  the  following  §  matrix. 

t&2)  (©,&)  t&@)  til)  &  3)  [@l!  I&@)  (2l)  13}  [|ti)  Ci$>)  [ilJ 


[&2) 

1  0 

0 

c 

0 

0 

0 

1-c 

0 

0 

0 

0 

0  \ 

[®4) 

j  0 

0 

0 

c 

0 

0 

0 

0 

0 

0 

1-c 

0 1 
0 

da 

d 

0 

0 

0 

0 

0 

1-d 

0 

0 

0 

0 

fiH 

0 

d 

0 

0 

0 

0 

0 

0 

0 

0 

1-d 

0 

l&3) 

0 

0 

0 

0 

0 

0 

1-d 

0 

0 

d 

0 

0 

[@A 

0 

d 

0 

0 

0 

0 

0 

1-d 

0 

0 

0 

0 

d@i 

0 

0 

0 

0 

1 

0 

0 

0 

0 

0 

0 

0 

UA)  | 

0 

0 

0 

0 

0 

1 

0 

0 

0 

0 

0 

0 

H3)  1 

0 

0 

0 

0 

0 

c 

0 

0 

0 

0 

1-c 

0 

[©A  1 

0 

0 

c 

0 

0 

0 

0 

0 

0 

0 

0 

1-c  ' 

d©i 

1  0 

0 

0 

0 

0 

1 

0 

0 

0 

0 

0 

0 

UA 

\  o 

0 

1 

0 

0 

0 

0 

0 

0 

0 

0 

0 

LET  PRA  First  of  all,  we  must  derive  the  expected  time,  E(t„), 

to  elapse  until  the  first  reference  to  a  page  p^  starting  from  a  page 
p^  (l^i,  j  2  n)  in  order  to  establish  the  decision  rule  of  this  PRA. 
Let  I\  be  the  matrix  obtained  from  P  by  setting  p ~ ^ jk  (Kronec^er's 
delta)  for  all  k,  i.e.,  making  page  p^  an  artificial  trapping  state 


[H2],  as  shown  below. 


P1  P2 
P1  /  P11  P12 

P2  I  P21  P22 


PJ  °  ° 


Pn  1  Pnl  Pn2 


pj 

•  •  •  P 

n 

plj 

Pln 

P2j 

P2n 

# 

• 

• 

1 

...  0 

■ 

• 

• 

; 

Pnj 

...  p  | 
nn  ] 

(2.2.8) 


Observing  that  the  j-th  column  of  (P^)  gives  the  probabilities  that 

page  p^  is  referenced  at  the  k-th  reference  starting  from  each  page  [H2], 

we  see  that  the  probabilities  that  page  p^  is  referenced  for  the  first 

time  at  the  k-th  reference  starting  from  each  page  is  given  by  the  j-th 

k  k™  I 

column  of  the  difference  matrix,  (P.)  -  (P  ).  (Note  that  page  p  is 

..J  J 

the  artificial  trapping  state.)  Therefore, 


I  k{<Pt^-(p1)k'1}  = 


-y  -i 


where  (P.)  =  ( 0 ) . 


?1  ...  Pj  ...  Pn 

...  E( t j . )  ...  | 

p2  I  ...  E(c2j)  •••  . 


pn  l  *•*  E(tnj)  ••• 


(2.2.9) 
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Repeating  a  simlilar  calculation  for  each  j  (lfjsn),  we  can  obtain  the 
matrix  E(t)  which  contains  E^j)  as  its  i-j  element.  ' 


E(t) 


/  1  E(v  •••  "<',»> 

[  E(t21)  1  ...  E(t2n) 

•  •  • 

•  •  • 

<  »  « 

i  E(tnl)  E(tn2)  ' **  1 


(2.2.10) 


We  call  E(t)  the  distance  matrix  and  E^)  the  distance  from  a  page  Pi 

to  a  page  Pj.  It  should  be  noted  that  this  matrix  is  not  necessarily 
symmetric. 

Now  we  can  describe  the  LET  PRAs  more  clearly  using  the  concept  of 
distances.  Suppose  that  a  page  Pi  was  found  to  be  missing  in  primary 
memory  when  I  sm|=m  in  the  course  of  program  execution.  Then, 

f(vvypi>  -  (yy^iMPjUPi}) 

where  -f  E(t  )  =  max  E(t  )  if  F-LET  is  used, 

Pk£Sm 

E(t  )  =  max  E(t,  )  if  B-LET  is  used,  (2.2.11) 

PkGSm 

V  sa  =  sa  =  'p  (memoryless  algorithm). 

This  means  that  the  F-LET  and  the  B-LET  PRAs  coincide  if  the  ordering  of 
row  elements  of  the  distance  matrix,  with  respect  to  the  magnitude  of 
their  values,  is  the  same  as  that  of  the  corresponding  column  elements 
(or  more  strongly,  if  the  distance  matrix  is  symmetric).  For  this 
reason,  the  LRU  PRA  presumably  performs  well  if  the  distance  matrix  of 
real  programs  tends  to  be  symmetric. 


63 


For  the  P  matrix  of  Eq.  (2.2.7),  with  c  =  0.1  and  d-0.5,  we  obtain 
the  following  distance  matrix. 

1  1.9  1.16^ 

E(t)  =31  1.58 

4  1  1  / 


Therefore,  as  is  clear  from  Eq.  (2.2.11),  the  decision  rule  of  the 

F-LET  PRA  can  be  stated  as  follows: 

r  .  ,  t+1 

SL  page  to  be  removed 

{P1iP2^  p3  P1  fp2,p3^ 

{P2,P3}  pl  p2  tpPp3^ 

f P^jP3i  p2  P1  ^  P2,P3^ 


Similarly,  the  decision  rule  of  the  B-LET  PRA  can  be  stated  as  follows: 

g1-  r  page  to  be  removed  s*"+ 

m  t  m 

{p!,p2}  p3  p2  fpPp3^ 

{P2,P3}  P1  P3  (PPP2} 

{ppP3}  p2  P1  ^  P2,P3^ 


Now,  let  {&,  j  }  denote  a  state  of  Z  where  sm=  f  Pp  P^  and  sp=  {Pi}. 
Then,  for  each  of  LET  PRAs,  we  have  the  same  set  cf  six  states  as 
obtained  for  the  Random  PRA.  ihe  transition  matrix  for  the  F-LET  PRA 


is  given  by 
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f&2}  {1,0}  {gl3}  {2 Si  {0,3} 
ld>, 2}  0  c  0  1-c  0  0 

fi>0}  d  o  o  i-d  o  o 

q  —  [@,3}  0  0  0  1-d  d  0 

[2,®}  0  0  1  0  0  0 

f&3}  0  0  c  0  0  1-c  /  c=0. 1 

0  0  1  0  0  O'  d=0. 5 

The  corresponding  matrix  for  tha  B-LET  PRA  is  given  by 

{©,2]  U,®)  (®,3j  [2,@J  (1,®) 

0  c  0  0  0  1-c 

d  0  0  0  0  1-d 

d  0  0  1-d  0  0 

0  0  1  0  0  0 

0  0  c  0  0  1-c  c=0. 1 

0  °  1  0  0  0  /  d=0. 5 

Thus,  we  have  obtained  the  transition  matrix  £  which  characterizes 
the  Markovian  behavior  of  the  PRA  system  E  for  each  PRA. 


£0,2} 

U,0} 

=  (0,3} 

[2,0} 
{d),3} 
£1,0} 
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2.2.3.  Evaluation  of  Paging  Behavior  of  Markovian  Programs 

Now  chat  we  have  enumerated  the  number  of  system  states,  each  of 
which  satisfies  the  Markovian  property,  and  have  shown  how  the  state 
transitions  can  be  determined  when  a  PRA  is  chosen  for  a  given  program, 
the  next  natural  step  is  to  evaluate  the  program  behavior  in  this 
environment.  Therefore,  this  section  is  concerned  with  how  often  a 
page  fault  occurs  as  a  given  Markovian  program  is  executed  in  a  given 
amount  of  primary  memory  under  the  dynamic  memory  managemant  of  a  given 
paging  algorithm. 

We  will  use  the  missing-page  probability  p  defined  below  to  measure 
the  frequency  of  page  faults. 

expected  number  of  page  faults  observed  during  the  period  [0,  tj 

p  m  lijn  . . — . . . — . - . —  't  . .  — 

(2.2.12) 

It  should  be  noted  that  the  missing-page  probability  is  the  inverse  of 

mhbpf,  as  is  clear  from  this  definition. 

As  a  Markovian  program  is  executed  under  a  given  PRA.  the  PRA 

system  E  operates  over  a  set  of  N  Markovian  system  states  making  two 

8 

kinds  of  state  transitions,  i.e.,  those  each  accompanied  by  a  page 
fault  and  those  not  accompanied  by  a  page  fault.  We  call  them  page- 
fault  transitions  and  non-page- fault  transitions  respectively.  Then, 


the  missing-page  probability  can  be  expressed  as 
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P  =  11m  Prob [occurrence  of  a  page-fault  transition  at  time  tj 

r3  /  ,'S(  SUI"  °f  the  a11  condltional  probabilities 
=  ?.  of  page-fault  transitions  from  state  k, 

-  V.  \  given  that  the  system  is  now  in  state  k 

(!.*♦!*) 

where  n(k)  is  the  steady-state  probability  of  the  k-th  state  of  the 
system  E  .  The  value  of  jt(k)  (k-1, 2,  . . . ,  N.)  can  be  determined  by  solv¬ 
ing  the  balance  equations  (H2] 

-  =  -S  (2.2.14) 

where  n  is  a  row  vector  whose  elements  are  n(k)  (k=l,2,..,  N  )  Thus 

'  ’  ’  s  ’ 

we  have  presented  a  method  to  evaluate  the  missing-page  probability 
(or  mhbpf )  of  a  given  Markovian  program  operating  in  a  given  amount  of 
primary  memory  under  the  dynamic  memory  management  of  a  given  demand 
paging  algorithm. 

Next,  we  will  give  some  numerical  results  using  the  example  of  the 
previous  section.  Let  us  first  evaluate  the  performance  of  the  three 
page  program,  operating  in  the  two  page- frame  primary  memory  under  the 
LRU  PRA,  that  we  discussed  in  Section  2.2.2.  Solving  Eq.  (2.2.14)  with 
the  matrix  (c  =  0.1,  d  =  0.5)  obtained  before,  we  get 

£  =  (^«I,2),n(l,O),n(0,3),jr(2,O),n«D,3),n(l,Q))) 

=  (0.204,  0.020,  0.388,  0.204,  0  ,  0.184  ) 

Then,  we  can  calculate  the  missing-page  probability,  using  Eq.  (2.2.13), 
as  follows: 
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p  *  0.204  (  1-c  )  +0.020  (  1-d  )  +0.388  p  +0.184 
=  0.184+0.010+0.194  +  0.184 
=  0.572 

The  missing-page  probabilities  were  similarly  evaluated  for  other 
PRAs  considered  in  the  previous  sections.  The  result  is  summarized  in 
Table  2-1.  It  should  be  pointed  out  that  the  result  of  Table  2-1 
depends  heavily  on  a  particular  choice  of  the  P  matrix.  However,  it  is 
reasonable  to  expect  that  the  B-LET  and  the  LRU  PRAs,  which  use  the 
"backward1'  information  about  the  program  behavior,  generally  do  not 
perform  well  for  a  Markovian  program  whose  P  matrix  is  rather  asymmet¬ 
ric;  in  fact,  their  performance  shown  in  Table  2-1  is  significantly 
worse  than  that  of  others.  Similarly,  the  FIFO  PRA  does  not  perform 
well.  On  the  other  hand,  the  F-LET  PRA,  which  uses  the  "forward" 
information  about  the  program  behavior,  outperformed  other  PRAs.  It 
is  however  known  that  the  F-LET  PRA  does  not  necessarily  minimize  the 
missing-page  probability  of  a  first-order  Markovian  program.  Chang  [Cl] 
actually  gave  a  better  PRA,  for  this  Markovian  program,  which  has  the 
same  decision  rule  as  that  of  the  F-LET  PRA  described  in  Section  2.2.2. 
except  that  if  a  page  fault  occurs  when  s^s  {  P2>P^J  then  page  p^  is 
removed.  For  this  PRA,  the  missing-page  probability  is  found  to  be 
0.388  by  the  above  evaluation  method. 

Thus,  it  has  been  shown  that  the  paging  behavior  of  any  Markovian 
program  can  be  analytically  evaluated  as  a  function  of  (1)  program 
characteristics,  (2)  the  paging  algorithm,  and  (3)  primary  memory  size. 
However,  the  method  which  has  been  described  in  Section  2.2.  has  a 
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Table  2-1  Comparison  of  PRA  Performances 


PRA 

Missing-Page  Probability 

Random 

0.475 

LRU 

0.572 

FIFO 

0.565 

F-LET 

0.408 

B-LET 

0.673 
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serious  drawback  concerning  the  amount  of  required  computation.  To 
examine  this  problem,  let  us  assume  that  the  computer  system  uses  a 
memoryless  PRA  (such  as  the  Random  or  the  LET  PRA) .  The  number  of 
system  states  was  found  to  be 

„  /  n  s  n ! 

N  =  m  (  )  =  - — 777- - 77- 

s  m  (m-1)  !  (n-m) ! 


and  we  must  solve  the  simultaneous  equations  in  Ng  variables,  given  by 
Eq.  (2.2.14),  in  evaluating  the  missing-page  probability.  Although 
sparsity  of  non-zero  elements  in  a  §  matrix  may  help,  solving  such  simul¬ 
taneous  equations  in,  say,  three  hundred  variables  requires  much  compu¬ 
tation  time.  For  example,  if  we  have  n=10  and  m=8,  then  we  get  Ns=360. 
If  we  have  a  pair  of  more  reasonable  numbers,  like  n=50  and  m=40,  for  a 

typical  situation  of  the  Multics  system,  then  N  becomes  a  tremendously 

s 

large  number.  Therefore,  if  we  want  to  study  the  behavior  of  more 

realistic  programs,  it  seems  that  we  must  somehow  simplify  our  model. 

King  [K1  ]  used  a  simplified  model,  for  which  P^j=  Pj  for  all  i  in 

the  P  matrix,  in  specifying  a  program's  reference  pattern.  This  means 

that  the  page  to  be  referenced  at  time  t+1  no  longer  depends  on  the 

page  being  referenced  at  time  t.  Therefore,  the  program  behavior  is 

completely  memoryless  and  the  third  component  of  a  state  of  the  PRA 

system  Z  ,  s^,  can  be  reduced  to  an  empty  set,  i.e.,  s^  cp  .  This  type 

of  program  behavior  model  is  called  a  zeroth-order  Markovian  program 

(or  a  multinomial  process  model).  The  number  of  system  states  de- 

s 


creases  in  this  simplified  model  by  a  factor  of  m,  but  the  evaluation 
of  this  method  still  requires  a  large  amount  of  computation. 
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2.3.  Zeroth-order  Markov  Model  Applied  to  the  Page  Size  Problem 

In  Section  2 . 3,  we  will  consider  the  effect  of  another  system 
parameter,  i.e.,  the  page  size,  upon  the  paging  behavio;:  of  programs, 
because  the  page  size  must  be  changed  to  examine  its  effect,  we  must  be 
concerned  with  a  program's  reference  pattern  not  only  over  its  pages  but 
within  each  of  those  pages.  We  must  be  concerned  also  with  how  a  compiler 
forms  a  program's  pages  by  combining  smaller  pieces  of  that  program,  i.e., 
the  pagination  process  of  the  compiler.  The  effect  of  page  size  upon  a 
program' 3  paging  behavior  has  been  experimentally  studied  by  Hatfield  et 
al  [HI],  Informatics,  Inc.  [II],  and  Baer  et  al  [Bl],  This  aspect  of 
the  page  size  problem  ,  i.e.,  this  aspect  of  the  problem  of  determining 
the  optimum  page  size,  has  rarely  been  studied  in  an  analytical  approach. 
So  far  as  known  to  the  author  of  this  thesis,  only  Woolf  [W3]  has  examined 
the  effect  of  page  size  upon  the  frequency  of  paging  analytically.  The 
approach  to  be  taken  in  this  section  is  completely  different  from  that  of 
Woolf  because  of  the  way  in  which  programs  are  modeled.  Other  aspects 
of  the  page  size  problem  will  be  discussed  later  in  Chapter  5. 


2.3.1.  Proi^am  Behavior  Model  for  the  Page  Size  Studies 

A  program  usually  possesses  a  fundamental  property  known  as  locality 
of  information  references  [Dl-3],  i.e.,  a  program's  tendency  to  reference 
a  subspace  of  its  address  space  during  any  interval  of  its  execution. 

We  call  a  set  of  pages  in  such  a  favored  subspace  of  its  address  space 
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a  set  of  favored  pages  or  a  favored  set1  in  this  section.  Control  of  a 
program  usually  stays  in  a  favored  set  for  some  time  and  then  enters 
another  favored  set.  Therefore,  we  can  evaluate  the  program  behavior 
by  analyzing  the  behavior  of  each  successive  favored  set  using  a  Markov 
model  described  in  Section  2.2. 

If  we  take  a  close  look  at  each  page  of  a  favored  set,  we  find 
that  there  are  portions  which  are  referenced  frequently  (e.g.,  loops) 
and  protions  which  are  not  (e.g.,  program  code  following  a  loop,  rarely 
used  data,  etc.)  We  roughly  call  each  of  these  logically  identifiable 
portions  a  program  block.  Thus,  each  page  of  a  favored  set  is  composed 
of  (several)  program  blocks  combined  together  by  a  compiler. 

Now  we  are  concerned  with  a  program's  reference  pattern  on  the 
level  of  program  blocks.  In  view  of  the  computational  drawback  of 
first-order  Markovian  programs  discussed  in  Section  2.2.3,  we  use  a 
zeroth-order  Markovian  program  model  for  the  behavior  of  a  favored  set. 
Let  b  (j=l,2, . . . )  and  p(b  )  be  the  j-th  program  block  and  its  probabil- 
ity  reference  in  the  favored  set  respectively.  For  the  sake  of  sim- 
assume  that  all  the  program  blocks  have  the  same  size,  s(b), 
and  that  only  one  page  size,  s(p),  is  used  on  the  computer  system. 
Furthermore,  we  assume  that  the  block  size  s(b)  divides  the  page  size 
s(p).  Denoting  the  i-th  page  of  the  favored  set  and  its  probability 

1Roughly  speaking,  a  "favored  set"  corresponds  to  a  program's  module 
being  referenced  by  a  process.  This  loosely  defined  concept  should  be 
distinguished  from  the  more  strictly  defined  "working  set"  of  Denning 
[Dl, D3] . 
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of  reference  by  p^(i=l, 2, . . . , n)  and  p(p^)  respectively,  we  have 

p(p.)  =  2  P (b . )  (2.3.1) 

b.  e  p.  J 
J  i 

n  n 

Ep(p,)  =  r  r  p(b  )  =  1  (2.3.2) 

i=l  i=lb.ep.  J 

J  i 

Because  these  program  blocks  generally  have  different  probabilities 
of  reference,  the  program  pages  of  the  favored  set  have  different 
probabilities  of  reference.  One  of  the  very  interesting  problems  in 
this  area  of  study  is  that  of  determing  a  set  of  program  blocks  to  be 
contained  in  each  page  of  the  favored  set  so  as  to  minimize  the  expected 
number  of  page  faults  during  its  execution.  This  problem  is  called  the 
pagination  problem.  The  paging  behavior  of  program  is  known  to  depend 
heavily  on  the  quality  of  pagination  [Hi]. 


2.3.2.  Pagination.  Page  Size,  and  Missing-Page  Probability 

We  proceed  to  evaluate  the  performance  of  programs  under  paging  in 
terms  of  missing-page  probability,  for  different  paginations  and  differ¬ 
ent  page  sizes.  It  is  known  that  the  F-LET  PRA  is  optimum  for  zeroth- 
order  Markovian  programs  in  the  sense  that  it  minimizes  the  expected 
number  of  page  faults  [A1 ] .  Therefore,  we  assume  that  this  PRA  is  in 
use  for  the  computer  system  under  study. 

We  now  consider  three  different  paginations  for  the  program  (the 
favored  se  )  described  in  Section  2.3.1,  i.e.,  a  zeroth-order  Markovian 
program.  Without  loss  of  generality,  we  can  assume  that  b.  and  p^  are 
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both  numbered  in  the  order  of  monotonically  decreasing  probability; 
that  is  to  say. 


p(bL)  s  p(b2)  >  p(b3)  S;  ...  ^  p(bk) 
P(Pi)  *  P(P?)  -  p(p-)  *  ...  £  p(p  ) 


(2.3.3) 

(2.3.4) 


Best  Paginai.ion  The  best  pagination  must  have  the  property  that  the 
missing-page  probability  p  is  minimum  for  any  size  m  (OSmSn  pages)  of 
primary  memory  available  to  the  program.  Noting  that  the  F-LET  PRA 
retains  m-1  most-frequently-used  pages  and  the  page  which  caused  a  page 
fault  for  a  zeroth-order  Markovian  program,  we  get 


P  =“  i-  Z  P(P.) 
i=l  1 


(2.3.5) 


Let  r  be  the  number  of  program  blocks  that  can  be  contained  in  a  page, 

i.e.,  r=s(p)/s(b).  Then,  the  best  pagination  is  the  one  for  which  the 

most-frequently-used  page  p^  contains  the  r  most-frequently-used  program 

blocks  b^,b2, . . .,br,  the  second  most -froquently-used  page  p2  contains 

the  second  r  most-frequently-used  program  blocks  b  .  b  ...  b 

r+1’  r+2  2r’ 

and  so  on.  In  other  words, 

Pi  =  ^  b(i-l)r+l>  b ( i - 1 ) r+2 ’  ***  J  bir  ^  1*lSn  (2.3.6) 


It  is  easy  to  prove  that  this  pagination  has  the  property  that  the 
missing-page  probability  is  minimum  for  any  m,  under  the  F-LET  PRA. 
Furthermore,  page  references  are  best  localized  by  this  pagination. 

Worst  Pagination  It  seems  that  any  one  pagination  generally  cannot 
guarantee  that  the  missing-page  probability  given  by  Eq.  (2.3.5)  is 
maximum  for  every  size  m  of  primary  memory.  To  see  this,  assume  that 
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there  exists  a  worst  pagination  with  page  reference  probabilities  p*(p  ) 
(i=l , . . . , n) .  Then,  for  an  arbitrarily  chosen  pagination  with  page 
reference  probabilities  p(p^)  (i=l , 2, . . . , n) ,  the  following  inequalities 
must  be  simultaneously  satisfied; 

P*(Pj)  s  P(Pj)  for  m=l 

P*(PX)  +  P*(P2)  £  p(Pl)  +  p(p2)  for  m=2 

k  :  k  : 

z  P*(pt)  ^  E  p(p  )  for  m=k 

i=l  1  *  i=l  1 

n  :  n  :  | 

^  P*(p^)  ^  E  p(p.)  for  m=n  / 

i=l  i=l  1 


(2.3.7) 


where  p*(p^)  and  p(p^)  are  both  numbered  in  the  non- increasing  order,  as 
given  by  Eq.  (2.3.4).  The  first  inequality  of  Eq.  (2.3.7)  means  that 
the  largest  page  reference  probability  is  minimized  by  the  worst  pagina¬ 
tion.  There  generally  exists  just  one  pagination  which  satisfies  this 
inequality,  but  tnis  pagination  does  not  guarantee  the  rest  of  the  in¬ 
equalities  of  Eq .  (2.3.7),  in  general.  Therefore,  we  must  give  up 
searching  for  the  "universal"  worst  pagination  that  is  independent  of  m. 
Instead,  we  say  that  a  given  pagination  is  near-worst  if  the  resulting 
page  reference  pattern  is  reasonably  close  to  a  uniform  distribution. 

In  fact,  it  can  be  shown  that  as  r  =  s(p)/s(b)  approaches  infinity  the 


missing-page  probability  of  such  a  near-worst  pagination  for  a  given 
size  (m)  of  primary  memory  becomes 


P  =  1  -  Ep(P.)  =  —  =  1  -  2 

i-r  1  n  n 


(2.3.8) 


The  following  simple  algorithm  seems  to  be  a  practical  one  to  derive  a 
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near-worst  pagination. 

Algorithm  to  derive  a  near-worst  pagination  Assume  that  the  reference 
probabilities  of  program  blocks  of  a  given  program  (a  favored  set)  are 
arranged  in  the  non- increasing  order  and  that  we  want  to  derive  a  near- 
worst  pagination  for  page  size  2s(b).  The  derivation  algorithm  consists 
of  successive  mergers  of  program  blocks;  merge  the  most- and  the  least- 
frequently-used  blocks  (b^  and  b  ),  merge  the  second  most- and  the  second 
least-frequently-used  blocks  (b^  and  b^  p,  merge  the  third  most-. and 
the  third  least-frequently-used  blocks,  and  so  on,  in  order  to  form  the 
pages  which  tend  to  have  the  same  magnitude  of  reference  probabilities. 

If  the  number  (nr)  of  the  program  blocks  is  not  even,  an  appropriate 
modification  is  necessary  (e.g.,  adding  a  dummy  block).  If  we  want  a 
near-worst  pagination  for  page  size  4s(b)  ,  then  arrange  the  above 
newly  created  pages  of  size  2  s(b)  in  the  order  of  non- increasing  proba¬ 
bility  and  repeat  the  above  pair-wise  successive  mergers. 

Random  Pagination  The  program  blocks  are  randomly  selected  and  merged 
into  pages.  The  missing-page  probability  of  a  random  pagination  is 
bounded  by  those  of  the  best  and  the  worst  paginations.  Perhaps,  a 
random  pagination  is  fairly  close  to  a  casual  pagination  found  in  most 
actual  user  programs.  The  analytical  evaluation  of  the  missing-page 
probability  of  this  pagination  for  a  given  program  is  not  as  easy  as  it 
seems,  but  the  performance  of  this  pagination  could  be  easily  evaluated 
by  a  similation.  (We  will  not  consider  this  pagination  any  further  in 


this  thesis.) 
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Next,  we  present  an  illustrative  example  of  the  program  behavior 
under  paging,  to  show 

•  the  effect  of  pagination  (due  to  a  compiler's  code  optimization, 
loading  order  of  program  modules,  programming  style,  etc.) 

*  the  effect  of  page  size 

upon  the  missing-page  probability  (the  inverse  of  mhbpf)  as  a  function 
of  the  size  of  available  primary  memory. 

Example  We  must  specify  the  reference  pattern  of  a  program  to  be 
studied.  Let  us  assume  that  s(b)  is  256  words  and  that  the  distribution 
of  p(b.)  is  given  by  Figure  2-3.  Then,  we  can  numerically  evaluate  the 
lower  and  the  upper  bounds  of  the  missing-page  probability  of  an  arbi¬ 
trary  pagination  for  a  given  page  size,  by  examining  the  performance  of 
the  best  anu  the  worst  paginations. 

Let  us  consider  the  case  of  s(p)=512  T  >rds,  first  of  all.  Then,  we 
obtain,  for  the  best  pagination, 

p(p1)=l/16+l/16,  p(p?)=l/16+l/16,  p(p3)=l/32+l/32 
.  ,  p(p30)=l/128+l/128 


and  for  a  near-worst  pagination, 

P (Pl) =1/16+1/128,  p(p2) =1/16+1/128,  p(p3)=l/i6+l/128 
.  ,  P(P30)=1/128+1/128 


Noting  that  memory  management  is  under  the  demand  paging  F-LET  PRA.  we 
obtain  the  result  shown  in  Figure  2-4.  It  is  seen  that  the  region  bounded 


istribution 


p  missing-page 


Figure  2-4  Effect  of  Pagination  on  Missing-Page  Probability 


79 


by  the  two  curves1,  which  represents  the  variation  of  program  periorm- 
ance  due  to  pagination,  is  rather  narrow  for  this  choice  of  page  s^ze, 
i.e.,  s(p)=512  words.  Similarly,  the  bounding  curves  were  derived  for 
each  choice  of  page  size  (256,  512,  1024,  and  2048  words)  using  the 
algorithms  described  above,  in  order  to  examine  the  effect  of  page  size 
upon  the  missing-page  probability  p.  This  result  is  shown  in  Figure  2-5. 
It  is  clear  that  the  bounded  region  becomes  wider,  having  a  common  lower 
bound  -urve,  as  the  page  size  increases.  It  should  be  noted  that  the 
upper  bound  curve  for  s(p)=2048  words  is  ve-y  close  to  the  straight 
line  given  by  Eq .  (2.3.8).  In  summary,  the  following  observations  may 

be  made: 

(1)  If  a  given  program  is  optimally  paginated  (like  the  best 
pagination),  a  choice  of  page  size  does  not  affect  the  missing- 
page  probability  (or  mhbpf). 

(2)  The  choice  of  smaller  page  size  tends  to  attain  the  smaller 
missing-page  probability  (or  the  longer  mhbpf)  if  a  given 
program  is  casually  paginated. 

We  assumed  in  the  above  analysis  that  program  blocks  are  relocatable 
within  a  program's  address  space.  However,  it  is  apparent  that  a 
program  will  behave  as  poorly  as  the  one  with  a  near-worst  pagination 


1The  bound  given  by  a  near-worst  pagination  is  not  exactly  an  upper 
bound  because  it  is  not  the  worst  possible  pagination  for  each  memory 
size  m,  but  it  is  believed  to  be  fairly  close  to  the  tightest  upper 
bound/  For  this  reason,  it  is  treated  as  if  it  is  the  exact  upper  bound. 


p  missing-page 
l  probability 


s (pi *2048  words 


1024  words 


512  words 


near-worst 


paginations 


256  words 


^  ,  111 

,\  y'-js 

Ny^ 


best  paginations 

(256,  512,  1024,  2048  words) 


k  words 

size  of  available 
primary  memory 

Figure  2-5  Effect  of  Page  Size  on  Missing-Page  Probability 


81 


even  If  the  program  blocks  are  not  relocatable,  if  that  program  happe-s 
to  possess  such  a  bad  loading  order  of  machine  instructions  as  the  one 
generated  by  the  near-worst  pagination.  Therefore,  program  behavior 
like  that  seen  in  this  section  should  be  universally  found  on  any  virtua 


memory  computer  system. 

Finally,  some  comments  may  be  in  order  about  the  drawbacks  of  tne 
model  which  has  been  used  in  Section  2.3.  First  of  all.  it  should  be 
remembered  that  the  model  is  based  on  the  stationary  zeroth-order 
Markov  property  of  program  behavior.  This  is  perhaps  the  least  accurate 
assumption  of  this  model.  Second,  the  model  assumes  a  single  program 
block  size  which  divides  the  page  size.  In  order  to  be  more  realistic, 
we  must  consider  the  probability  distribution  of  block  size  and  page¬ 
boundary  crossover  of  program  blocks.  Third,  we  have  not  derived  the 
exact  performance  of  actual  (casual)  paginations.  The  lower  bound  of 
the  missing-page  probability  given  by  the  best  pagination  may  be  loose 
if  a  non-opt imal  PRA  (like  the  LRU  PRA)  is  in  use.  However,  in  spite  of 
these  drawbacks  of  this  simplified  model,  it  has  successfully  demonstrated 


the  basic  relationship  between  pagination,  page  size,  and  the  stead'- 
state  paging  behavior  of  programs.  While  the  smaller  page  size  was 
found  to  yield  a  longer  mhbpf  in  the  steady-state  program  behavior,  it  is 
clear  that  the  larger  page  size  yields  a  longer  mhbpf  in  it  transient 
state  behavior  (e.g..  when  the  primarv  memory  is  not  vet  filled  with 
pages).  This  aspect  will  be  further  considered  in  the  next  section  and 


,r  in  Chapter  5  when  the  page  size  problem  will  he  discussed  from 


overall  viewpoint. 
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2.4.  Random  Behavior  Model  Applied  to  the  Memory  Size  Problem 

In  Section  2.4.,  we  will  consider  a  simplified  zeroth-order  Markovian 
program,  called  a  random  behavior  model,  to  study  primal  ly  the  effect  of 
primary  memory  size  upon  its  paging  behavior.  We  will  consider  both  the 
transient -state  program  behavior,  i.e.,  the  initial  stage  of  program 
execution  generating  a  number  of  page  faults,  and  the  steady-state 
program  behavior,  i.e.,  the  following  stage  of  program  execution  where 
the  rate  of  page  faults  is  smaller  because  available  primary  memory  space 
is  fully  utilized.  It  has  been  experimentally  observed  [B5,Hl,T2J  that 
paging  behavior  of  programs  is  more  sensitive  to  the  available  primary 
memory  sLe- than  to  any  other  parameters  (e.g.,  a  page  replacement  algorithm), 
while  primary  memory  is  usually  the  most  expensive  element  of  a  computer 
system.  Therefore,  an  evaluation  of  program  behavior  within  a  limited 
amount  of  primary  memory  is  very  important  from  a  cost-performance  view¬ 
point.  We  will  express  the  mhbpf  of  a  random  behavior  model  of  programs, 
as  a  function  of  program  size,  primary  memory  size,  and  a  parameter 
which  determines  the  total  length  of  program  execution  time.  It  will  be 
clearly  seen  how  thrashing  (i.e.,  excessive  competition  for  primary  memory 
page-frames  leading  to  a  less  than  optimal  use  of  system  resources)  starts 
to  occur  as  the  primary  memory  size  is  gradually  reduced. 


2.4.1.  Random  Behavior  Model 

We  will  again  assume  partitioned  primary  memory  under  the  management 
of  a  demand-paging  local  page  replacement  algorithm,  as  in  Sections  2.2 
and  2.3.  A  general  zeroth-order  Markovian  program,  used  in  Section  2.3, 
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is  further  simplified  by  requiring  that  all  pages  (except  a  special 
page)  be  equally  probable  in  its  page  reference  pattern. 

This  program  behavior  model  is  assumed  to  have,  besides  ordinary 
pages,  two  special  pages,  i.e.,  a  starting  page  which  contains  an  entry 
point  of  the  program  and  an  ending  page  which  contains  an  exit  point  of 
the  program;  it  is  assumed  that  program  execution  begins  from  the  start¬ 
ing  page  px  and  ends  immediately  upon  entrance  to  the  ending  page  pn 
where  n  is  the  size  of  the  program  under  study.  Transfer  of  processor 
control  among  n  pages  may  occur  only  at  the  multiples  of  a  unit  time  tQ, 

i.e.,  t  =  kt  where  k  =  1,2,3, -  It  is  furthermore  assumed  that  the 

ending  page  can  be  referenced  with  a  constant  probability  e  at  any  stage 
of  program  execution.  Therefore,  the  page  reference  pattern  of  this 
program  behavior  model,  which  we  call  a  random  behavior  mqd_el  or  a  random 
program  hereafter,  is  characterized  by  the  following  transition  matrix. 
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In  other  words, 
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Ptj  =  P(Pj)  *  i^n,  j^n 

Pin  =  P(Pn>  =  e 

pnj  “  0 


(2.4.2) 


Thus,  a  random  behavior  model  has  three  independent  parameters,  i.e., 
program  size  (n),  the  reference  probability  (e)  of  the  ending  page,  and 
the  unit  time  (tg)  of  this  discrete-time  Markovian  model.  The  program 
being  modeled  references  the  starting  page,  then  some  number  of  pages  at 
random,  and  finally  the  ending  page.  The  number  of  pages  referenced 
during  the  execution  of  this  program  is  a  random  variable.  This  may  be 
interpreted  in  such  a  way  that  the  number  of  pages  to  be  referenced  during 
a  program  execution  depends  on  the  arguments  given  to  that  program,  but 
a  random  program  represents  nevertheless  a  very  special  class  of  programs. 
The  essense  of  this  artificial  program  consists  in  its  capability  to 
yield  some  insights  into  the  memory  size  problem,  i.e.,  the  problem  of 
evaluating  the  effect  of  primary  memory  size  upon  the  dynamic  paging 
behavior  of  programs.  Because  of  the  uniformness  of  its  page  reference 
characteristics,  the  paging  performance  of  a  random  program  does  not 
depend  on  a  particular  demand-paging  page  replacement  algorithm  in  use. 


2.4.2.  Evaluation  of  Paging  Behavior  of  Random  Programs 

We  will  analytically  evaluate  the  mhbpf  of  a  random  program 
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operating  in  a  limited  amount  of  primary  memory  under  demand  paging,  as 

an  explicit  function  of  the  size  (m)  of  primary  memory  and  the  three 

independent  parameters  (n,e,and  t^)  of  a  random  program,  in  this  section. 

First  of  all,  we  should  briefly  consider  mhbpf  because  there  seem 

to  exist  at  least  two  reasonable  definitions  of  mhbpf  for  our  problem. 

Suppose  that  we  make  K  experiments  to  run  a  random  program.  Let  Te(Tei) 

and  n  (n  )  be  the  execution  time  of  the  random  program  and  the  number 
P  Pi 

of  page  faults  generated  by  this  program,  in  the  (i-th)  experiment. 

Then,  the  following  two  definitions  of  mhbpf  seem  to  be  reasonable: 


mhbpf ^ 


mhbpf2 


K  T  T 

E  (-—V K  =  E  (— £-) 
1=1  pi  P 
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E  T 

i  S3  1 


el 


E  (Te) 

K  -  E  (n  ) 

E  nn( 
i=l  Pl 


(2.4.3) 


In  deriving  mhbpf,  all  experiments  are  equally  weighted  in  the  former 
definition,  while  all  page  faults  are  equally  weighted  in  the  latter 
definition.  Because  we  usually  use  the  latter  definition  in  calculating 
mhbpf  from  on-line  monitoring  results  of  an  actual  computer  system,  we 
use  the  latter  definition  of  mhbpf  in  evaluating  the  paging  performance 
of  a  random  program  in  this  section. 

Now  we  proceed  to  derive  the  mhbpf  of  a  random  program,  characteriz¬ 
ed  by  n,  e,  and  tQ,  operating  in  primary  memory  of  m  page-frames.  Let 
Si  and  Sl  ( i=0. 1 , 2, . . . )  respectively  denote  a  state  of  the  random  program 
where  it  is  still  running  after  causing  exactly  i  page  faults  and  a  stare 
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of  the  random  program  where  it  has  referenced  the  ending  page  after 
causing  exactly  i  page  faults.  There  exist  o-ly  three  possible  state 
transitions  from  i.e.,  a  transition  to  itself,  a  transition  to  S.+1, 
and  a  transiton  to  Si+r  Now  we  want  to  derive  the  probabilities  associ 

•ted  with  these  three  state  transit  ions , i . e . ,  p  D  - 

Hi,i  '  Pi,  i+1 J  3nd  Pi,  i+r 

Because  each  page  of  the  program  is  assumed  to  be  equally  probable  in  a 
page  reference  string  at  any  stage  of  the  program  execution,  all  that 
matters  is  the  number  of  pages  which  exist  in  primary  memory.  If  we 
assume  that  no  page  of  the  program  was  initially  in  primary  memory,  then 
there  must  be  min(i,m)  pages  in  primary  memory  when  the  program  is  in 
state  Sr  Therefore,  noting  Eq.  (2.4.1)  or  Eq.  (2.4.2),  the  probabil¬ 
ities  associated  with  these  state  transitions  are  found  as  follows: 

Pi,i  =<nin(i,m£f 

Pi,i+1  "  (n-l-min(i,m))“Y  (2.4.4) 

Pi,i+1  =  e 

Thus,  the  entire  paging  behavior  of  the  random  program  is  described  by 
the  state  transition  diagram  of  Figure  2-6.  It  should  be  noted  that  the 
behavior  of  this  random  program  is  now  formulated  as  a  d iscrete-time 
Markov  process.  It  is  seen  that  if  the  program  fits  into  the  available 
memory  space  (nSm  +  l)  the  number  of  page  faults  to  be  generated  is  at 
most  n,  but  if  the  program  does  not  fit  into  the  memory  space  (n>m  +  l) 
an  infinite  number  of  page  faults  can  occur  at  least  theoretically.  A 
comparison  of  Figure  2-6  (a)  and  Figure  2-6  (b)  reveals  that  the  Figure 
2-6  (b)  with  n  =  m  +  1  reduces  to  the  Figure  2-6  (a).  This  means  that  it 
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suffices  to  analyze  only  the  latter  case,  i.e.,  n^m  +  1,  because  the 
result  of  the  former  case  (nim+1)  can  be  obtained  simply  by  setting 
n=m  +  l  in  the  analysis  of  the  latter  case  (n^m  +  1).  Therefore,  we 
assume  hereafter  that  n*m  +  l,i.e.,  the  program  does  not  (necessarily) 

fit  into  the  primary  memory  space. 

Let  p(np)  be  the  probability  that  np(2inp)  page  faults  result 
during  the  execution  of  this  random  program.  This  probability  can  be 
obtained  by  multiplying  all  the  probabilities  given  on  the  branches 
leading  to  the  state  Snp  of  Figure  2-6  (b),  with  an  extra  consideration 
on  looping  at  each  intermediate  state  [H2].  If  2Snpin+l,  we  obtain 
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If 


n  -  m  +  1 . 
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is  similarly  obtained  as 
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Now  let  w  be  the  ratio  of  the  reference  probability  of  the  ending  page 
and  that  of  an  ordinarv  page.  i.e.. 
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(2.4.6) 


We  call  w  the  ending-page  weight.  If  the  ending-page  weight  is  one, 
that  is,  if  the  reference  probabities  of  all  the  n  pages  of  the  random 
program  are  equal,  Eq.  (2.4.5)  reduces  to 
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(2.4.7a) 


(2.4.7b) 


From  these  results,  we  can  readily  obtain  the  average  number,  E(n  )  of 

p  ’ 

page  faults  to  be  caused  by  this  program.  To  illustrate  the  shape  of 

P(np),  some  examples  are  given  in  Figure  2-7.  Figure  2-7  (a)  shows  the 

effect  of  w  on  P(np),  for  a  random  program  which  fits  into  the  memory 

space,  and  therefore  the  value  of  np  ranges  from  2  to  n.  On  the  other 

hand.  Figure  2-7  (b)  shows  an  example  of  a  random  program  which  does 

not  fit  into  the  memory  space  and  therefore  the  value  of  n  ranges  from 

P 

2  to  infinity. 

Next,  we  proceed  to  derive  the  average  program  execution  time,  E(T  ) 

’  e 

Noting  that  the  program  execution  time  is  the  sum  of  the  holding  times  of 
all  the  intermediate  states  leading  to  the  end  state,  we  first  derive  the 
expected  holding  time,  E(ti),  of  a  state  The  holding  time  in  Si 

follows  a  geometric  distribution  with  a  parameter  1-p  ,  and  therefore. 


noting  Eq.  (2.4.4), 
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(tf)  = 
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The  general  shape  of  E(ti)  is  shown  in  Figure  2-8.  It  is  seen  that  as 
the  number  of  pages  in  primary  memory  increases  the  expected  holding 
time  (or  the  mean  headway  between  page  faults)  increases.  From 

Eq.  (2.4.8),  we  obtain  the  average  program  execution  time  with  exactly 

n  page  faults,  E(T  (n  )). 
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(2.4.9) 


The  general  shapes  of  the  curves  which  show  the  growth  of  the  number  of 
pages  in  primary  memory  and  that  of  the  number  of  pages  faults  are  given 
in  Figure  2-9.  Averaging  E(T>  ))  obtained  above,  with  respect  to  n  , 
we  can  obtain  the  desired  average  execution  time,  E(Tg),  of  the  program, 
as  follows. 

CD 

E  (Te)  =  E  P(n  )  E(T  (n  )) 
n  =2  H  e  P 
P 


(2.4.10) 
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However,  E(Te)  may  be  obtained  by  a  more  direct  method.  Let  S^, 

Sf,  and  Sg  denote  respectively  a  start ing  state  where  the  random  program 
is  about  to  start  its  execution  with  no  page  in  primary  memory,  a  running 
state  where  the  program  is  running  without  ever  referencing  the  ending 
page,  and  an  ending  state  where  the  program  has  rel  renced  the  ending 
page.  Noting  that  the  reference  probability  of  the  ending  page  is 
assumed  to  be  e  (a  constant)  in  any  stage  of  the  program  execution,  we 
find  that  these  three  states  form  a  simple  Markov  chain  shown  in  Figure 
2-10  Therefore,  the  average  execution  time,  E(Te),  of  this  program  is 
given  by  the  first  passage  time  to  state  S^.  Thus, 

00 

E  (T  )  =  (  1+  £  i* (l-e)i-*e  )tft 
i=0  W 


=  (  +  2 
'  w 


)t, 


(2.4.11) 


It  is  seen  that  the  reference  probability  of  the  ending  page  completely 
determines  the  average  execution  time  of  the  program  and  that  the  average 
execution  time  is  roughly  proportional  to  the  ratio  of  the  program  size 
and  the  ending-page  weight. 

Finally,  we  can  obtain  the  mhbpf  of  this  random  program  by  combining 
the  results  of  Eq.  (2.4.5)  and  Eq.  (2.4.11),  as  follows: 
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Figure  2-10  Simpler  Derivation  of  the  Average 
Execution  Time  of  a  Random  Program 
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In  particular,  if  the  reference  probabilities  of  the  ending  page  and  an 
ordinary  page  are  equal,  i.e.,  w=l,  then  Eq.  (2.4.12)  reduces  to 


mhbpf 

2in±li  n£m+l  (2.4.13  a) 

n+2  0 

2(n-lJln+lJ -  n £  m+2  (2.4.13b) 

2n  - 2mn+(m+2 )  (m- 1 ) 


It  can  be  easily  verified  that  as  m/n  approaches  zero,  i.e.,  as  the 
available  memory  size  becomes  much  smaller  than  the  program  size,  the 
mhbpf  of  the  program  approaches  t^,  the  unit  time  of  this  discrete-time 
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Markov  process  model. 


2.4.3.  Numerical  Examples  of  Random  Program  Behavior 

In  the  previous  section,  the  average  execution  time,  the  average 
number  of  page  faults,  and  the  mhbpf  of  a  random  program  were  explicitly 
obtained  as  a  function  of  the  primary  memory  size  (m),  the  program  size 
(n),  the  ending-page  weight  (w),  and  the  unit  time  of  the  discrete-time 
Markov  process  model  (tQ).  In  this  section,  the  effects  of  these  para¬ 
meters  on  the  program's  performance  will  be  numerically  evaluated. 

As  is  clear  from  Figure  2-7  (a)  and  Eq .  (2.4.11),  the  ending-page 
weight  w  has  a  major  effect  on  both  the  average  number  of  page  faults 
and  the  program's  average  execution  time.  Therefore,  the  effect  of  w 
upon  the  program's  mhbpf,  which  is  the  ratio  of  these  two  averages,  was 
numerically  evaluated  in  Figure  2-11.  It  is  seen  that  the  smaller  the 
value  of  w  becomes  the  longer  mhbpf  becomes.  The  other  point  to  be 
noted  is  that  mhbpf  is  almost  independent  of  the  program  size  so  long 
as  the  program  fits  into  the  available  primary  memory  space. 

Next,  we  move  on  to  the  central  theme  of  this  section,  that  is,  a 
numerical  evaluation  of  the  effect  of  primary  memory  size  m,  using 
Eq.  (2.4.12).  This  result  is  summarized  in  Figure  2-12.  It  is  observed 
that  the  curve  representing  the  length  of  mhbpf  branches  into  several 
downward  curves  each  corresponding  to  a  particular  primary  memory  size 
as  the  program  size  increases.  In  particualr,  it  should  be  noted  that 
as  the  program  size  exceeds  the  memory  size  the  downward  branch  curve 
representing  mhbpf  abruptly  decreases  and  from  then  on  it  c  'ntinues  to 


Figure  2-12  Effect  of  Primary  Memory  Size  upon  mhbpf 
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decrease  only  gradually,  approaching  the  unit  time  tQ  of  the  model 
asymptotically.  This  kind  of  abrupt  decrease  of  mhbpf  was  called  thrash- 
iHS  by  Denning  [D2 ] .  However,  this  abruptness  shown  in  Figure  2-12  is 
somewhat  exaggerated  by  the  uniform  page  reference  probabilities  assumed 
in  the  random  behavior  model;  if  this  model  could  include  some  less- 
frequently-used  pages  also  this  abrupt  decrease  of  mhbpf  would  be  moder¬ 
ated.  The  other  way  to  interpret  this  figure  is  to  fix  the  program  size 
instead  of  the  primary  memory  size.  Then,  mhbpf 's  of  a  given  random 
program  operating  in  the  primary  memory  of  various  sizes  can  be  obtained. 
For  example,  the  mhbpf  of  a  random  program,  with  n  =  100  pages,  w  =  0.3, 
ant*  V4  ra*H*secont*j  operating  in  the  primary  memory  of  80  page-frames 
is  found  to  be  about  twice  as  long  as  that  of  the  same  program  operating 
in  the  primary  memory  of  40  page-frames.  The  ratio  of  mhbpf* s  correspond¬ 
ing  to  two  arbitrarily  chosen  primary  memory  sizes,  however,  depends  on 
the  chosen  values  of  m  and  n. 

Finall’y,  the  effect  of  primary  size  m  upon  the  program's  average 
total  execution  time  was  numerically  evaluated,  assuming  that  the  average 
total  execution  time  of  a  random  program  is  given  by  the  sum  of  the 
program's  average  execution  time  and  the  average  total  time  to  be  spent 
for  handling  page  faults.  In  other  words,  denoting  the  expected  paging 
overhead  time  necessary  for  handling  a  page  fault  ,by  t  , 

E (total  execution  time)  =  E(T  )  +  E(n  )*t  (2.4.14) 

e  p  p 

The  result  is  shown  in  Figure  2-13.  It  is  seen  that  the  average  total 
execution  time  of  each  random  program  gradually  increases  as  the  available 
primary  memory  size  becomes  smaller  than  the  program  size.  This  increase 


re 
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in  total  execution  time  is  due  entirely  to  the  increase  in  the  number  of 
page  faults  encountered  during  the  program  execution.  Brawn  and 
Gustavson  [B4]  measured  tne  paging  behavior  of  several  actual  programs 
on  the  IBM  M44/44X  system  and  plotted  the  curves  corresponding  to  those 
shown  in  Figure  2-13.  Their  measurement  result  of  actual  computer 
programs  is  very  similar  to  that  of  Figure  2-13,  except  in  the  following 
two  respects.  First,  the  increase  of  program  execution  time  was  observed 
by  them  when  the  available  primary  memory  size  became  smaller  than  a 
certain  fraction  (typically,  60  percent  rather  than  100  percent  seen  in 
Figure  2-13)  of  the  program  size.  This  probably  means  that  at  most  60 
percent  of  the  entire  program  pages  were  referenced  at  one  time  because 
of  locality  of  information  references.  Second,  the  slope  of  their 
curves  representing  increasing  program  execution  time  is  steeper  than 
that  of  those  shown  in  Figure  2-13.  This  probably  means  that  the 
(expected)  paging  overhead  time  of  their  system  is  larger  than  the  values 
(4  and  8  msec.)  that  we  assumed  in  Figure  2-13. 
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2.5.  Macroscopic  Paging  Performance  Model  for  Multiprogramming 

In  the  previous  sections,  we  assumed  a  partitioned  primary  memory 
with  a  local  page  replacement  algorithm,  and  therefore  we  did  not  consid¬ 
er  sharing  of  aegments  among  eligible  user  processes.  However,  on 
actual  computer  systems  like  the  one  described  in  Section  1.2,  any  non¬ 
resident  program  as  well  as  any  resident  supervisor  program  is  sharable 
among  eligible  user  processes.  Therefore,  we  will  now  proceed  to  consid¬ 
er  the  effect  of  sharing  of  resident  and  non-resident  programs  upon  the 
paging  behavior  of  these  user  processes.  First,  we  will  develop  a 
simple  model  of  sharing  in  estimating  the  number  of  page-frames 
effectively  available  to  each  user  process  under  multiprogramming.  Then, 
in  deriving  the  mhbpf  of  these  user  processes,  we  will  use  a  linear  model 
of  paging  performance  discussed  by  Saltzer  [S3]  as  a  babis.  By  combining 
the  results  of  these  two  models,  it  will  be  possible  to  express  the  mtbpf 
of  an  eligible  user  process  as  a  function  of  the  size  (mrJmn>and  M)  of 
primary  memory,  the  degree  (q)  of  multiprogramming,  and  the  degree  of 

sharing. 


2.5.1.  Model  of  Sharing  among  Eligible  User  Processes 

As  we  have  seen  in  Section  2.5,  the  number  of  primary  memory  page- 
frames  available  to  a  user  process  plays  a  major  role  in  determining 
the  paging  behavior  of  programs.  In  this  section,  we  will  develop  a 
simple  model  of  sharing  and  then  use  it  to  estimate  the  number  of  primary 
memory  page-frames,  including  those  being  shared,  available  to  each  user 
process  under  multiprogramming. 
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We  assume  that  the  paging  behavior  of  all  user  processes  under  multi¬ 
programming,  i.e.,  all  eligible  user  processes,  is  probabilistically 
identical;  we  will  consider  only  one  class  of  user  processes  whose 
paging  behavior  is  probabilistically  specified.  We  assume  that  each 
eligible  user  process  is  scheduled  for  service  by  a  processor  under  the 
FCFS  discipline  and  therefore  the  processors  are  equally  accessible  to 
the  processes.  Thus,  it  is  fairly  reasonable  to  expect  that  each  of  q 
existing  eligible  user  processes  uses  approximately  the  same  amount  of 
primary  memory  under  the  management  of  a  demand-paging  global  page  re¬ 
placement  algorithm.  It  is  assumed  that  the  entire  primary  memory  space 
is  fully  utilized  by  these  q  eligible  user  processes  under  the  memory 
allocation  depicted  in  Figure  2-1. 

Now  we  are  ready  to  present  a  model  of  (segment)  sharing  among  the 
eligible  user  processes. 

Model  of  Sharing 

2 

(1)  Sharing  of  resident  programs:  a x 10  percent  of  m^  page-frames 
occupied  by  the  resident  programs  is  actually  used  by  each  eligible 
user  process,  on  the  average. 

2 

(2)  Sharing  of  non-resident  programs:  b  x  10  percent  of  the  memory 

space  (mg  page-frames  in  size)  of  each  eligible  user  process,  taken 

out  of  m  page-frames  reserved  for  non-resident  programs,  overlaps 
n 

with  the  memory  space  (m^  page- frames  in  size)  of  another  eligible 
user  process,  on  the  average.  Any  such  overlapping  of  memory  spaces 
is  independent  of  any  other  overlapping  of  memory  spaces. 
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We  will  first  investigate  the  properties  of  sharing  of  non-resident 
programs.  For  this  purpose,  let  P ^ ( i—1, 2, . . . ,q)  be  the  i-th  eligible 
user  process,  m0(pi> p2» *  *  * ,Pi)  be  the  avera8e  slze  of  overlapping  of  i 
(non-resident  program)  memory  spaces  used  by  processes  through  P^, 
and  m0(Pi+P2+* • •+*>i)  be  the  average  size  of  (non-resident  program)  joint 
memory  space  of  processes  P^  through  P^.  Then,  under  the  assumptions  of 
our  model  of  sharing,  we  have 


mo(pi’  V  =  b*mo 

m0(pi1  P2’  =  b  *m0 

m0(Pl’  P2’  ’  "  »  Pi)  =  1,1  1,m0 


(2.5.1) 


Therefore,  as  illustrated  in  Figure  2-14,  we  get 
m0(pi+  P2)  =  (2-b)m0 
m0(pi+  P2+  P3)  =  (3-3b+b2)m0 

In  general  [FI],  we  obtain 

"«<P1+  P2+  ' ' '  +  V 

■  ‘  E"0(Pi'  V  +  EVpi-  P»>  Pn>  ' 

all  singles  all  pairs  all  triples 

...  +  (•l)q'4m0(P1,  Pr  ...  ,  Pq) 
all  q-tuples 

=  [  (j) -1  +  (2)*(-b)  +  (^) ' (•b)2+  ...  +  (q) * ("b)q_1 1 *m0 

=  E  (?),(-b)i”1,m_ 
i=l  1 


(2.5.2) 
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Thus,  we  have  obtained  the  average  size  of  total  memory  space  which  is 
required  by  the  non-resident  programs  invoked  by  a  set  of  q  eligible  user 
processes . 

This  average  size  of  primary  memory  space  associated  with  the  q 
eligible  user  processes  can  be  easily  proved  to  have  the  following  prop¬ 
erties; 


Property  1 

lim  tnQ<  P  -I-  P2+...+  P  )  =  qmQ 
b-  0  q 

1  Im  ra  (  Pj+  P2+. ..+  P  )  = 

b—  1  q 

Property  2 

Am0(q)  *  W  V*'*  +  Pq+l)"m0(Pl+  V-^V 

=  (l-b)q*m0 


(2.5.3a) 

(2.5.3b) 


(2.5.4) 


Property  3 

lim  m  (  P.+  P,+. . .+  P  ) 
0  12  q 


(2.5.5) 


The  dependency  ot  mQ(P1+  P2+. ..+  P  )  upon  the  degree(b)  of  sharing 
is  seen  in  Property  1;  Eq.  (2.5.3a)  means  that  if  there  is  not  any 
sharing  at  all  the  total  memory  space  required  by  the  q  user  processes 
under  multiprogramming  is  simply  q  times  as  large  as  the  memory  space 
required  by  each  process,  and  Eq.  (2.5.3b)  means  that  if  non-resident 
programs  ar*  completely  shared  by  the  q  user  processes  under  multi¬ 
programming  (consider  a  special-purpose  computer  system)  the  total 
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memory  space  required  is  equal  to  the  space  of  only  one  process. 

Property  2  indicates  that  mg(P^+  P^.  ..+  P  )  is  a  non-decreasing 
function  of  the  degree  of  multiprogramming  and  that  if  one  more  user 
process  must  be  made  eligible  without  changing  the  effectively  available 
memory  space  (m^)  of  each  process  then  the  additional  primary  memory 
space  to  be  required  is  given  by  a  geometric  probability  distribution. 
Property  3  means  that  an  infinite  number  of  eligible  user  processes  can 
coexist  using  a  finite  primary  memory  space  because  we  assumed 
"independent"  sharing  (overlapping  of  memory  spaces)  of  the  processes' 
non-resident  program. 

Finally,  we  will  evaluate  the  average  number  of  page-frames  which 
are  available  to  each  of  these  q  user  processes  under  multiprogramming, 
assuming  that  a  given  primary  memoiy  of  M  page-frames  is  divided  into 
two  areas,  i.e.,  the  aiea  with  m^  page-frames  for  vcsident  programs  and 
the  area  with  m^^M-m^)  page-frames  for  non-resident  programs  (see 
Figure  2-1). 

From  the  first  assumption  concerning  our  model  of  sharing,  the 
average  number  of  page-frames  storing  the  pages  of  resident  programs 
used  by  an  eligible  process  is  am^.  On  the  other  hand,  the  average 
number  (mQ)  of  page-frames  available  to  the  eligible  process  for  storing 
non-resident  programs  is  found  from  Eq.  (2.5.2)  to  be 


m 


0 


m  • 

n 


b 


l(lb)q 


(2.5.6) 


Therefore,  the  average  total  number  (m)  of  page-frames  available  to 
each  of  these  q  user  processes  under  multiprogramming  (using  the  primary 
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memory  depicted  by  Figure  2-1)  is  obtained  as  follows, 
b  m 

m  =  am  +  “  (2.5.7) 

r  l-(l-b)q 

The  overall  percentile  saving  of  primary  memory  due  to  sharing  of 
segments  is  given  by  1  -  (M/qm). 


2.5.2.  Evaluation  of  mtbpf  in  Multiprogramming  Environment 

In  this  section,  we  will  present  a  simple  macroscopic  model  of 
paging  behavior  which  relates  the  average  number  (m)  of  page-frames 
effectively  available  to  a  user  process  under  multiprogramming  to  the 
mtbpf  of  the  user  process. 

Before  getting  into  the  development  of  this  model,  we  must  briefly 
consider  tbpf  (time  between  page  faults)  and  mtbpf  (mean  time  between 
page  faults).  It  was  previously  stated  that  each  burst  of  continuous 
program  execution,  i.e.,  tbpf,  consists  of  at  least  user  program  exe¬ 
cution  anfl  paging  overhead  execution.  To  be  more  realistic,  tbpf 
includes  a  proportion  of  time  (t^)  spent  as  miscellaneous  overhead  of 
the  supervisor1,  as  well  as  a  proportion  of  time  (tu)  spent  as  a  user's 
useful  work  and  a  proportion  of  time  (tp)  spent  as  paging  overhead  of 
the  supervisor  That  is  to  say, 

tbpf  =  tu+  tp4  tm  (2.5.8) 

^his  complication  is  now  necessary  as  a  step  to  evaluate  the  amount  of 
user-oriented  computation  in  Chapter  3. 
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The  first  exponent,  t|i,  of  the  tbpf  includes  the  execution  of  both  user 
programs  and  non-overhead-type  (user-oriented)  supervisor  programs  (e.g., 
input/output  control  programs),  and  Is  generally  called  a  headway  between 
HS-Uaolts  (hbpf).  The  second  component,  tp,  of  the  tbpf  includes  eeveral 
supervisory  functions  required  to  handle  a  page  ft  ,lt,  as  roughly 
explained  in  Section  2.1,  and  is  generally  called  a  pasinu  overhead 
The  third  component,  t^,  of  the  tbpf  includes  several  supervisory 
functions  required  to  handle  miscellaneous  faults,  such  as  segment 
faults,  protection  faults,  various  non-paging  interrupts,  etc.,  which 
occur  during  the  duration  of  the  tbpf,  and  1.  generally  called  a 
miscellaneous  overhead  time.  Denoting  average  values  by  barred  symbols, 
the  mean  time  between  page  faults  (mtbpf)  is  expressed  as 


mtbpf  =  t  +  t  +  t  . 

u  p  m* 


(2.5.9) 


where  tu,  tp,  and  tm  respectively  represents  the  mean  headway  betw^^n 
^ge_faults_(mhbpf),  the  mean  paging  overhead  time,  and  the  mean 
miscellaneous  overhead  time. 


We  assume  that  the  mhbpf  (or  I  )  of  a  user  process  under  multi¬ 
programming  can  be  determined  by  the  average  number  (m)  of  page-frames 
that  are  effectively  available  to  the  user  process  and,  in  particular, 
that  the  mhbpf  is  generally  expressed  ao 


t  =  t  (m) 
u  u 


=  V 


c^m 


2 

c2m  + 


3 

5^  *f «  »  # 


(2.5.10) 


where  <^0=0, 1.2,  . . .)  are  constant  coefficients.  On  the  other  hand,  the 
miscellaneous  faults  (segment  faults,  protection  faults,  non-paging 
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interrupts,  etc.)  are  likely  to  occur  uniformly  at  any  instant  during 
the  execution  of  a  user's  useful  work.  Therefore,  it  is  reasonable  to 
assume  that  the  miscellaneous  overhead  time  is  linearly  proportional  to 
the  mhbpf.  That  is  to  say, 

t  =  6  t  (2.5.11) 

m  u 

where  6  (a  constant)  is  called  the  miscellaneous  overhead  coefficient. 

It  has  been  experimentally  observed  on  the  Multics  system  that  the 

paging  overhead  time  is  roughly  independent  of  the  primary  memory  size, 

although  the  execution  times  of  certain  supervisory  functions  included 

in  the  paging  overhead  (e.g.,  the  execution  of  a  page  replacement 

algorithm)  may  depend  on  the  primary  memory  size.  Therefore,  we  assume 

that  the  paging  overhead  time,  t  ,  does  not  depend  on  the  primary  memory 

P 

size.  Thus,  the  mtbpf  of  an  eligible  user  process  which  effectively 
uses  m  page-frames  of  the  primary  memory  is  given  by 


m 


tbpf  =  mtbpf(m) 


=  (  t  +  t  )  +  t 
'  u  m  p 


=  (  1  +6  )•  tu(m)  +  tp 


(2.5.12) 


On  the  other  hand  ,  Saltzer  [S3]  describes  a  linear  model  of 
paging  performance  ,  which  was  developed  from  performance  measure¬ 
ments  of  the  Multics  system  .  He  observed  that  the  mhbtf  of  an  eligi¬ 
ble  user  process  is  approximately  linearly  proportional  to  the  size  of 
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primary  memory  ,  for  a  wide  range  of  the  primary  memory  size.  This 
observation  justifies  (at  least  partially)  the  approximation  of  7 with 
only  the  second  term  of  Eq.  (2.5.10).  Thus, 


tu(m)  =  Vm  (2.5.13) 

We  call  this  simple  model  of  mhbpf  a  linear  paging  model  and  we  expect 
it  to  serve  as  a  macroscopic  model  of  paging  behavior  of  user  processes. 

Therefore,  combining  this  linear  paging  model  and  the  model  of 
sharing  developed  in  the  previous  section,  we  obtain  the  following 
expression  for  the  mtbpf  of  an  eligible  user  process. 


mtbpf  =  (  1 +6  )•  tu(m)  +  tp 

=  (  1  +6  )  •  c-  m  +  7 
1  P 

b  m 

=  (  1+6  )•  c  •  (am  + - S-r)  +  tn  (2.5.14) 

1  r  l-(l-b)q  p 


The  behavior  of  the  above  three  components  (t  ,  t  ,  and  t  1  of  the 

u*  m’  p'  L  Ie 

mtbpf  is  explained  as  a  function  of  the  primary  memory  spate  (m  )  avail- 

n 

able  to  non-resident  programs  in  Figure  2  15.  The  gross  behavior  of  the 
mtbpf  as  a  function  of  mn  and  the  degree  (q)  of  multiprogramming  is 


This  observation  was  made  about  the  averaged  behavior  of  an  aggregate 
of  programs  invoked  by  an  eligible  user  process.  It  does  not  necessarily 
represent  the  behavior  of  a  particular  program. 
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t  ;  mean  headway  between  page  faults  made  in  resident  programs 


t  ;  mean  miscellaneous  overhead  time  spent  in  resident  programs 
mr 

t  :  mean  headway  between  page  faults  made  in  non-resident  programs 
un 

"t  ;  mean  miscellaneous  overhead  time  spent  in  non-resident  programs 
mn 


t 

u 


t  +  t 
ur  un 


t  =  t  +  t 
m  mr  mn 


mtbpf  =  t  +  t  +  t 
u  m  p 


Figure  2-15  Behavior  of  Various  Components  of  mtbpf 


114 


similarly  shown  in  Figure  2-16.  It  should  be  noted  that  the  mtbpf  of  an 
eligible  user  process  under  multiprogramming  of  a  given  degree  increases 
linearly  with  the  available  primary  memory  size.  Finally,  to  evaluate 
the  effect  of  sharing  upon  the  mtbpf  of  eligible  user  processes  under 
multiprogramming,  three  cases  concerning  sharing  (no  sharing,  ten  percent 
sharing,  twenty  percent  sharing  of  non-resident  programs)  were  considered 
This  result  is  given  in  Table  2-2.  It  is  observed  that  the  effect  of 
sharing  upon  mtbpf  becomes  more  evident  as  the  number  of  user  processes 
that  must  compete  with  each  other  within  a  fixed  amount  of  primary  memory 
increases.  Later  in  Chapter  5,  the  gain  in  overall  system  performance 
(i.e.,  the  system  throughput  and  the  system  response  time)  due  to  sharing 
of  programs  will  be  numerically  evaluated. 


Figure  2-16  Effect  of  Primary  Memory  Size  and  Degree  of  Sharing 
upon  mtbpf 
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Table  2-2  Effect  of  Multiprogramming  and  Sharing  upon  mtbpf 


Degree  of 
Multiprogramming 

9 

bxO 

Degree  of  Sharing 
of  Non-Resident  Programs 

baO.l  b  =  0.2 

1 

99.5 

msec  99.5  msec 

99.5  msec 

2 

54.5 

56.9 

59.5 

3 

39.5 

42.7 

46.4 

4 

32.0 

35.7 

40.0 

5 

j  27.5 

31.5 

36.3 

6 

24.5 

28.7 

33.9 

7 

22.3 

26.8 

32.3 

8 

• 

20.7 

• 

25.3 

• 

31.1 

• 

• 

GO 

• 

• 

9.5 

• 

• 

18.5 

• 

• 

27.5 

M=384  page-frames,  m^  =  84  page-frames,  m^  =  300  page-frames, 

a=0,l,  t  =  7  msec . 

1  P 
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2.6.  Summary 

Because  program  behavior  is  so  complex  that  we  have  developed 
several  models  each  of  which  studies  only  a  particular  aspect  o_  program 
behavior,  rather  than  attempting  to  develop  a  single  "universal'  program 
behavior  model.  The  first  three  models  respectively  attack  the  PRA 
(page  replacement  algorithm)  studies,  the  page  size  problem,  and  the 
memory  size  problem.  The  analyses  of  these  models  respectively  allows 
us  to  evaluate  numerically  the  possible  differences  in  program  perfor¬ 
mance  (the  mean  time  between  page  faults)  due  to  the  PRA  (page  replace¬ 
ment  algirithm)  in  use,  the  page  size  being  used,  and  the  size  of 
primary  memory  available  to  a  user  process.  The  macroscopic  paging 
performance  model  described  in  Section  2.5  is  based  on  a  model  of 
sharing  and  Saltzer's  linear  paging  model,  and  allows  us  to  evaluate 
the  mtbpf  (mean  time  between  page  faults)  of  user  processes  under  multi¬ 
programming  of  a  given  degree. 


CHAPTER  3 


THROUGHPUT  ANALYSIS 


3.1.  Introduction 

Having  studied  program  behavior  within  a  limited  amount  of  primary 
memory,  we  shall  proceed  to  evaluate  the  performance  of  a  processing 
system  consisting  of  processors,  primary  and  seconday  memories,  and 
user  and  supervisor  programs  (see  Figure  1-4).  The  hardware  is,  in 
general,  assumed  to  possess  multiple  processors  and  multiple  primary 
memory  units,  with  secondary  memory  system  as  part  of  the  virtual 
memory  (see  Fig.  1-2).  The  operating  system  is  assumed  to  allow  multi¬ 
ple  jobs  to  possess  some  of  their  pages  in  primary  memory  using  a  multi¬ 
programming  mechanism. 

Our  ultimece  goal  of  this  thesis  is  to  develop  a  methodology  to 
optimize  such  multi-processer  multi-memory  multiprogrammed  virtual- 
memory  computer  system  in  cost-performance.  As  the  first  step  towards 
this  goal.  Chapter  3  presents  a  method  to  evaluate  the  performance  of 
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a  processing  system  of  a  given  hardware  configuration  (number  of 
Process.-. s,  number  of  primary  memory  units,  speed  of  processors,  speed 
of  memory  system,  size  of  memory  system,  etc.),  a  given  software 
configuration  (size  of  resident  supervisor,  various  overhead  rimes, 
degree  of  multiprogramming,  etc.)  and  given  user  program  characteristics 
(mhbpf,  etc.). 

We  choose  system  throughput,  i.e.,  the  average  numbei  of  jobs  that 
can  be  completed  within  a  unit  time,  as  our  performance  measure  of  the 
processing  system.  In  e  aluating  the  throughput  of  the  processing 
system  of  a  given  configuration,  various  wastages  of  the  system's 
computational  capacity  (various  idle  times  and  system  overhead  times  to 
be  defined)  will  be  considered.  Therefore,  the  system's  effective 
computational  power  can  be  evaluated  accurately.  The  result  of  this 
analysis  then  allows  us  to  optimize  the  degree  of  multiprogramming 
realistically  in  such  a  way  that  throughput  of  the  system  is  maximized. 

It  will  be  seen  in  Chapter  4  that  this  optimization  approach  leads  to 
the  minimization  of  the  average  response  time  experienced  by  each 
interactive  user  of  the  computer  system. 
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3.2.  Preliminary  Considerations 

Before  getting  into  a  mathematical  anaysis  of  th*  system  through¬ 
put  of  a  multi-processor  multi-memory  multi-progra’*™®*!  virtual-memory 
computer,  some  preliminary  considerations  will  b©  given  in  this  section 
in  order  to  put  the  following  analysis  in  proper  perspective. 

We  have  seen  in  Chapter  2  how  the  three  components  (t  ,  t  t  )  of 

u  m  p 

mrbpf  depend  on  the  primary  memory  size.  We  now  consider  how  each  of 
these  can  be  affected  by  a  multi-processor  multi-memory  configuration 
of  the  processing  system.  A  program's  execution  time  is  generally 
prolonged  on  a  multi-processor  computer  system  because  of  multi¬ 
processor  interference  such  as  memory  cycle  interference  and  data-base 
lockout.  The  former  is  the  interference  caused  by  the  occasional 
conflict  of  multiple  processors  for  a  memory  cycle  of  a  particular 
primary  memory  unit  and  the  latter  is  the  interference  caused  by  the 
occasional  conflict  of  multiple  processors  for  the  use  of  a  particular 
shared,  writable  data-base  which  cannot  allow  simultaneous  accesses  of 
multiple  processors.  Because  a  memroy  cycle  of  a  primary  memory  unit 
or  a  shared  writable  data-base  can  be  used  only  serially,  only  one  of 
the  conflicting  processors  can  use  it  immediately  and  others  must  wait 
for  their  turn  to  use  it.  Therefore,  mtbpf  is  prolonged  on  the  multi¬ 
processor  computer  system. 

Table  3-1  summarizes  the  way  in  which  the  dependency  of  the  three 
components  of  mtbpf,  i.e., 

t 

_u 
t 

jn 
t 

P 


(mean  headway  between  page  faults), 
(mean  miscellaneous  overhead  time),  and 
(mean  paging  overhead  time), 
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upon  the  multi-processor  interference  as  well  as  upon  the  primar>  memory 

size  will  be  considered  in  later  mathematical  analyses.  It  is  assumed 

that  all  fault  handlers  (page  fault  handler,  segment  fault  handler, 

protection  fault  handler,  etc.)  of  the  supervisor  are  either  permanently 

resident  or  almost  always  resident  in  primary  memory  by  virtue  of  the 

fact  that  these  most- frequent ly-used  programs  are  likely  to  be  shared  by 

many  user  processes.  On  the  other  hand,  non-resident  programs  such  as 

user  programs  are  assumed  to  be  initially  in  the  secondary  memory  system 

and  to  be  brought  into  primary  memory  under  demand  paging,  as  stated  in 

Chapter  2.  Under  these  assumptions,  we  will  use  the  macroscopic  paging 

performance  model  described  in  Section  2.5;  t  is  Independent  of  primary 

memory  size  but  t  and  t  depend  on  (arc  roughly  linearly  proportional 
u  m 

to)  the  size  of  primary  memory  effectively  available  to  each  user 
process  under  multiprogranming,  as  shown  in  the  first  row  of  Table  3-1. 

It  is  well-known  that  the  execution  of  a  page  fault  handler 
involves  some  shared,  writable  data-bases  (i.e.,  resource  tables  and 
resource  queues  [SI])  which  may  significantly  delay  the  execution  of  the 
page  fault  handler  on  the  mu  It i -processor  computer  system,  while  the 
execution  of  users'  useful  work  or  of  miscellaneous  fault  handlers 
normally  dees  not  involve  such  frequently-used  shared  writable  data-bases. 
Therefore,  only  t  is  assumed  to  be  significantly  affected  by  data-base 
lockout,  as  shown  in  Table  3-1.  On  the  other  hand,  the  execution  of  any 
program  is  subject  to  memory  cycle  interference  on  the  multi-processor 
computer  system.  Therefore,  all  three  components  of  mtbpf  are  assumed 
to  be  affected  by  memory  cycle  interference,  ns  shown  in  Table  3-1.  It 
has  been  actually  found  on  the  Multics  system  (to  be  rtported  in 
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Section  5.2.1)that  a  noticeable  amount  of  processor  time  is  lost  due  to 
the  multi-processor  interference. 

In  order  to  consider  quantitatively  the  effect  of  this  multi¬ 
processor  interference,  let  t  t  and  t  .  denote  the  t  ,  t  ,  and  t 
r  ’  ui  mi  pi  u  m  p 

of  an  i-processor  ( i=l, 2 )  computer  system.  Letting  (1SK^2)  be 
the  slow-down  factor  due  to  memory  cycle  interference  on  a  dual  proces¬ 
sor  system,  we  have  the  following  relationship  between  the  t  of  a 
single  processor  system  and  that  of  a  dual  processor  system  with  the 

same  configuration. 

/  _ 

tu2=Kmtul  (3.2.1) 

where  the  value  of  K  will  be  determined  in  a  later  analysis.  Assuming 

m 

the  same  degree  of  memory  cycle  interference  for  the  handling  of  mis¬ 
cellaneous  faults,  we  have 


t  _  =  K  t 
m2  m  ml 


(3.2.2) 


On  the  other  hand,  the  relation  between  the  paging  overhead  time  of  a 
single  processor  system  and  that  of  a  dual  processor  system  Is  more 
complex  because  oi  the  additional  Interference  caused  by  data-base  lockout. 
We  assume  that  the  execution  time  of  the  page  fault  handler  on  a  dual 
processor  system  is  stretched  by  a  factor  of  (l£Kj£2  )  because  of 
data-base  lockout  and  is  further  prolonged  by  a  factor  of  because 
of  memory  cycle  Interference.  That  is  to  say. 


m 


t  _  =  K  •  K.  •  t  . 
p2  m  l  pi 


(3.2.3) 


Under  these  conditions,  the  linear  relation  between  t  ,  and  t  , 

ml  ul 


of  Eq.  (2.5.11)  is  now  generalized  to 
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t  .  =  fit  . 
mi  ui 


(3-2.4) 


1  =  1,2 

It  should  be  noted  that  the  miscellaneous  overhead  coefficient  6  is 
independent  of  the  number  (i)  of  processors.  Thus,  mtbpf  of  the  single 
and  dual  processor;  systems  can  be  expressed  as: 


«bpf(l  CPU)  =  tul+  tml+  tpl 

-  <>*>'.1+^1 

=  mtbpf, 


(3.2.5) 


mtbpf (2  CPUs).  tu2+  C.J+  tp2 


=  K  t  .+  K  t  ,+  K  K.t  . 
m  ul  m  ml  m  l  pi 

*  V  (  (1+6)Tul+  K/pl) 
S  V  (mCbPf2) 


(3.2.6) 


Finally,  we  are  ready  to  give  an  expression  for  the  percentage  of 
the  system's  computational  capacity  spent  as  users'  useful  work  (to 
be  called  the  percentile  throughput  of  the  system),  taking  into  con¬ 
sideration  all  of  the  above  system  overheads  and  the  possible  processor 
idle  time.  Let  u  be  a  utilization  factor,  of  the  processors  of  the 
processing  system,  which  will  be  analytically  obtained  by  the  through¬ 
put  analysis  of  this  chapter  as  a  function  of  various  system  parameters 
such  as 

(1)  the  npmber  of  processors, 

(2)  the  number  of  primary  memory  units, 

(3)  the  degree  of  multiprogramming, 
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(4)  various  system  overheads, 

(5)  mtbpfi  (1=1,2) 

(6)  mpft  (the  mean  page  fetch  time)  of  the  secondary  memory 
system. 

Then,  the  desired  percentile  throughput  0  of  the  multi-processor 
computer  system  can  be  expressed  as: 


0  =  u 


ui 


mtbpf (i  CPUs) 


(3.2.7) 


Now  let  T  be  an  arbitrary  length  of  time.  Then,  the  total  time  that 
can  be  spent  as  users'  useful  work  by  the  i  processors  during  T  is 
i0T.  Let  T^,  T  be  the  execution  times  of  k  user  jobs 

(excluding  all  the  system  overhead  times)  which  were  completed  during 
T.  Then,  assuming  that  the  computer  system  was  under  a  full  load,  the 
system  throughput  _9_  can  be  obtained  as  follows: 

£  5  lim  -f- 

T-.oo  1 


lim 
X-*  00 


Hr 


iQT 


T  ,+T  .+  •••  +T  . 
el  e2  ek 


=  lim 
T-oo 


i0 


i0 


(ST.  J/k 

j=l  J 


(3.2.8) 


where  Tg  is  the  average  execution  time  required  for  a  job's  useful  work. 
Therefore,  it  has  been  shown  that  the  percentile  throughput  is  linearly 
proportional  to  the  system  throughput. 
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_3. 3.  Single  Processor  System 

In  this  section,  we  will  be  concerned  with  the  performance  of  a 
single  processor  processing  system  under  multiprogramming  of  a  fixed 
degree  (q) .  It  is  assumed  that  the  time  between  page  faults  (tbpfj  of 
each  user  process  follows  a  certain  stationary  probability  distribution 
with  a  (constant)  mean  determined  by  the  size  of  primary  memory  avail¬ 
able  to  the  job.  (In  detailed  reality,  tbpf  or  hbpf  depends  on  the 
size  of  primary  memory  actually  being  used  at  the  moment  rather  than 
the  amount  of  memory  that  can  be  used,  as  seen  in  Chapter  2,  but  this 
initial  microscopic  transient  behavior  of  programs  is  only  macroscopi- 
cally  considered  hereafter.)  It  is  also  assumed  that  the  computer 
system  under  study  is  fully  loaded  with  user  jobs;  there  always  exist 
at  least  q  executable  user  jobs  on  the  computer  system.  This  means 
that  q  user  processes  are  being  multiprog rammed  at  all  times.  Therefore, 
in  modeling  the  behavior  of  the  multiprogramming  mechanism  described 
in  the  section  surrounded  by  a  broken  line  of  Figure  1-1,  we  need  not 
explicitly  consider  the  completions  of  job  executions  or  the  entries 
of  new  eligible  user  processes.  Furthermore,  because  the  processing 
system  has  only  one  processor  we  need  not  consider  any  multi-processor 
interference  in  this  section. 


3.3.1.  Single  Processor  Multiprogramming  Model 

A  model  of  multiprogramming  described  here  involves  the  following 
resources  of  the  computer  system. 
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(1)  a  processor  queue  (Q^)  which  contains  page-ready  processes, 
i.e.,  the  eligible  user  processes  waiting  for  a  service  by  a 
processor  of  the  processing  system 

(2)  a  processor  (CPU)  which  can  service  one  eligible  user  process 
at  a  time 

(3)  a  secondary  memory  system  (SM)  from  which  a  missing-page  is 
brought  into  primary  memory  on  demand  for  the  eligible  user 
process  that  ^ias  requested  the  page. 

As  explained  in  Section  1.2,  if  an  eligible  user  process  is  waiting 
for  a  processor's  service  in  the  processor  queue  Qp,  the  process  is 
said  to  be  in  the  page-ready  state.  If  the  process  is  being  serviced 
by  the  processor,  it  is  said  to  be  in  the  running  state.  If  the 
process  is  waiting  for  a  missing-page,  it  is  said  to  be  in  the  page^. 
wait  state.  The  (mean)  length  of  time  in  the  running  state  exactly 
corresponds  to  the  (mean)  time  between  page  fau  -s  and  the  (mean) 
length  of  time  in  the  page-wait  state  exactly  corresponds  to  the 
(mean)  page  fetch  time.  All  the  eligible  user  processes  simultaneously 
cycle  around  these  three  states  in  our  conceptual  model  of  multi¬ 
programming,  as  shown  in  Figure  3-1. 

We  usually  assume  that  the  jobs  in  Qp  are  scheduled  under  the 
FCFS  discipline  for  the  processor's  service  when  the  processor  is 
available.  A  preemptive  priority  scheduling  discipline  is  also 
considered  briefly.  We  assume  that  a  service  by  the  secondary  memory 
svstem  to  locate  and  transfer  a  missing-page  of  a  process  into  primary 
memory  is  always  immediately  initiated  and  carried  out  in  parallel 
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with  services  by  the  same  secondary  memory  system  for  other  page-wait 
processes,  the  input/output  channel  is  assumed  not  to  limit  the  rate  of 
page  transfers  between  primary  and  secondary  memories. 

Finally,  we  must  specify  the  probabilistic  property  of  both  the 
time  between  page  faults  (tbpf)  and  the  page  fetch  time  (pft).  We 
assume  that  tbpf  independently  follows  an  exponential  distribution  with 
a  mean  of  1/a  which  is  determined  by  the  macroscopic  paging  performance 
model  of  Section  2.5.  That  is  to  say, 


f(t)  =  ae 


where 


-at 


t  ^  0 


1  - 


(3.3.1) 


mtbpfi  =  -=  tul+  tml+  tpl  =  <l+6)tul+  tpl  I 


Similarly,  we  assume  that  pft  independently  follows  an  exponential 
distribution  with  a  mean  of  1/p.  That  is  to  say, 


g(t)  =  pe"pt 


t^O 


(3.3.2) 


where  mpft-l/p  should  be  determined  by  a  separate  analysis  using  a 
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secondary  memory  model  (see  Figure  1-4) 1.  This  assumption  would  be 
reasonable  especially  if  secondary  memory  consists  of  a  combination  of 
frequen^ly-used  high-speed  devices  and  less-frequently-used  low-speed 
devices. 


3,_3,2.  Multiprogramming  with  FCFS  Scheduling 

We  will  proceed  to  evaluate  the  system  throughput  of  the  single 
processor  processing  system  described  in  the  previous  section,  assuming 
that  the  FCFS  discipline  is  used  in  scheduling  page-ready  eligible  pro¬ 
cesses  [C3].  We  assume  that  tbpf  of  each  eligible  user  prrcess  follows 
the  identical  probability  distribution  of  Eq.  (3.3.1). 

e 

Then,  a  state  of  our  model  of  the  processing  system  is  characterized 
by  the  number  (i)  of  page-ready  processes  (waiting  in  Qp),  the  number  (j) 


Coffman's  result  concerning  a  paging  drum  [C5,C6]  or  the  well-known 
result  of  M/G/l  queues  [C8]  may  be  used  in  this  area.  In  particular, 
the  built-in  meters  of  the  Multics  system  show  that  Coffman's  result, 
reproduced  below,  is  accurate  enough  to  predict  the  mpft  of  a  paging 
drum,in  practice.  Letting  T, N,  and  p  respectively  be  the  drum  revolution 
time,  the  number  of  sectors,  and  the  sector  utilization  factor, 

mpft  =  ™.  +  J_ 

1  -  P  N  , 

where  the  first  and  the  second  terms  respectively  represent  the  mean 
access  time,  which  is  subiect  to  a  queuing  delay,  and  the  constant  page 
transfer  time. 
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of  running  processes  (using  the  CPU),  and  the  number  (k)  of  page-wait 
processes  (associated  with  the  SM),  where  i  +  j+k  =  q  and  j^l,  as  shown 
in  Figure  3-1.  Therefore,  each  (i,j,k)  tuple  defines  a  state  of  the 
model,  as  given  in  Table  3-2.  Noting  that  the  holding  time  of  each 
state  is  exponentially  distributed,  we  see  that  the  behavior  of  this 
model  can  be  regarded  as  a  continuous  time  Markov  process,  in  fact,  a 
simple  birth  and  death  process  known  as  the  machine  repairman  model 
(or  the  machine  interference  problem)  [Fl,J2],  defined  over  these  q  +  1 
states  of  Table  3-2.  All  possible  state  transitions  and  the  correspond¬ 
ing  transition  rates  are  shown  in  the  state  transition  diagram  of 
Figure  3-2. 

The  probability  ( i=0, 1, . . . , q)  that  one  finds  the  system  in 

at  a  randomly  chosen  instant  after  the  system  has  been  in  operation  for 
a  long  time,  i.e.,  the  steady-state  probability  of  a  state  S^,  can 
be  obtained  by  solving  the  balance  equation  [FI, H2 ] 

ait,=  (q-i+l)  p  «i_1  IsiiSq  (3.3.3) 

q 

under  the  condition  E  n,  =  1  .  Thus, 

i=0 


If  the  transition  rate  associated  with  an  S.  S  transition  is  a, 

i  j 

then  this  means  that  the  conditional  probability  that  this  transit  on 


will  occur  in  the  next  At,  given  that  the  present  state  is  S^,  is  given 
byaA*-  for  j^i  «nd  suitably  small  At. 


133 


n 


q  « 

E  <oJ/j!) 
j-0 


OSISq 


(3.3.4) 


where  q=a/p.  In  particular,  the  processor  idle  probability  is 
given  by 


qq/q'. 

°D  q  . 
r  (qJ/j!) 
j=o 


(3.3.5) 


Therefore,  the  percentile  throughput  of  this  system  is  obtained 
from  Eq.  (3.2.7)  as 


(1"n0)*(H6)tu'J+  T  j  "  (l'n0)Ctul 


(3.3.6) 


The  corresponding  system  throughput  can  be  readily  obtained  from 
Eq.  (3.2.8). 


3.3.3.  Multiprogramming  with  Preemptive  Priority  Scheduling 

We  vill  briefly  consider  the  system  throughput  of  the  same  single 
processor  computer  system,  assuming  that  a  preemptive  priority  (PP> 
discipline  like  that  of  the  Multics  system  [01 J  is  used  in  scheduling 
page-ready  eligible  user  processes.  When  this  discipline  is  used,  aTi 
the  eligible  user  processes  have  their  own  distinct  priorities  upon 
which  preemption  decisions  are  solely  based.  Suppose  that  the  super- 
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visor  decides  co  allow  a  ready  process  to  become  eligible  at  a  certain 
Instant  when  there  exist  q'  eligible  user  processes  (see  Figure  1-1). 
This  entering  process  is  immediately  awarded  a  priority  q'  +1  (  -q), 
the  lowest  priority  (i.e.,  largest  in  value)  among  existing  q'  + 1 
eligible  user  processes.  Whenever  a  process  becomes  ineligible  by 
completing  the  execution  of  the  requested  user  job,  the  process  loses 
its  priority  and  accordingly  changes  o(  priority  of  other  eligible  user 
processes  take  place.  Suppose  that  the  leaving  process  has  a  priority 
q".  Then,  if  the  priority  of  a  remaining  process  Is  lower  than  that  of 
the  leaving  process  (i.e.,  larger  than  q^'),  then  the  priority  is 
heightened  by  one  (i.e.,  decreased  .  y  one  in  value);  otherwise,  the 
priority  is  left  unchanged.  Thus,  this  algorithm  gives  a  higher 
priority  trf.  an  older  eligible  user  process,  maintaining  a  set  of 
distinct  priorities  for  the  existing  eligible  user  processes.  The  page- 
ready  processes  in  are  always  ordered  according  to  their  distinct 
priorities  and  the  process  with  the  highest  priority  is  scheduled  for 
the  processor's  service  when  the  processor  is  aviilable.  What  is  more, 
if  the  priority  of  this  process  is  higher  than  that  of  the  running 
process,  this  process  is  allowed  to  preempt  the  processor  by  suspending 
the  progress  of  the  running  process  as  soon  as  possible.  Therefore,  an 
eligible  process  with  a  higher  priority  has  a  greater  chance  to  use  both 
the  processor  and  the  primary  memory.  Thus,  an  older  eligible  user 
process  tends  to  own  more  pages  in  primary  memory  than  newer  eligible 
user  processes,  under  a  global  page  replacement  algorithm.  We  assume 
again  that  the  computer  system  is  under  a  full  load. 
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Instead  of  using  the  tbpf  distribution  of  Eq.  (3.3.1),  we  assume 
the  following  set  of  equations  for  the  tbpf  distributions  of  user 
processes  under  the  PP  discipline. 

-o^t 

fj(t)  ■  a^e  for  the  highest  priority  process  > 

-a2t 

f 2 ( t )  ■»  a^e  for  the  priority  2  process 


-a  t 

fq(t)  ■  a^e  q  for  the  lowest  priority  process 
where  a.sa.s  ...  £q  and  t  *  0. 

1  1  q 


(3.3.7) 


The  fact  that  an  older  process,  l.e.,  a  higher  priority  process,  owns  a 
greater  number  of  get  in  primary  memory  is  reflected  by  a  longer  mtbpf, 
1/0^,  of  a  higher  priority  process.  As  for  the  pft  distribution,  we 
st 1 1 1  assume  Eq.  (3.3.7). 

What  characterizes  a  state  of  this  modified  model  is  not  just  the 
number  of  processes  in  each  of  those  three  states  of  eligible  user 
processes  but  the  combination  of  priorities  of  processes  in  each  of 
those  three  states.  Then,  it  is  again  possible  to  model  the  behavior 
of  this  model  by  a  continuous  time  Markov  process.  Let  an  (i,j,k)  tuple 
represent  a  state  of  this  model,  where  1  is  a  (particular)  combination 
of  priorities  cf  the  page-ready  processes,  j  is  a  priority  of  the 
runnino  -focess,  and  k  is  a  combination  of  priorities  of  the  page-wait 
processes.  Then,  it  can  be  easily  proved  that  the  total  number  (N  )  of 
these  (Markovian)  states  oe  this  model  is  given  by 

N  =  E  (q)  =  2q 
8  k=0  K 


(3.3.8) 
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Unfortunately,  a  general  expression  for  like  Eq.  (3.3.4)  of  the 
FCFS  scheduling  model,  is  too  complicated  to  write  out  (although  jt^  can 
be  easily  evaluated  numerically)  and  therefore  we  will  simply  give  an 
illustrative  example. 

Example  Case  of  q  =  2 

Because  the  supervisor  allows  only  two  eligible  processes  in  primary 
memory  there  exist  only  priority  1  and  2  processes.  Defining  four 
possible  states  as  shown  below,  we  obtain  a  state  transition  diagram 
shown  also  below. 


Qp  CPU  SM 

so 

(1,2) 

S1 

(1)  (2) 

S2 

(2)  (1) 

S3 

(2)  (1) 

Letting  n  =  (jtg,  n^,  n^,  n^)  be  the  steady-state  probabilities  of 
states  (Sq,  S^,  S^,  S^),  these  steady-state  probabilities  can  be 
obtained  by  solving  the  balance  equation 

rr  •  A  =  0 
3 

under  the  condition  I  n  =  1,  where  A  is  the  transition  rate  matrix  [H2] 

i=0 

and  takes  the  following  form  for  this  example. 
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The  processor  idle  probability  is  given  by  y  Therefore,  the 
percentile  throughput  can  be  obtained  by  inserting  nQ  into  Eq.  (3.3.6) 
of  the  previous  section.  The  priority  1  (higher  priority)  process  uses 
the  processor  nith  a  probe-  llity  y  +  y  and  the  priority  2  (lower 
priority)  process  with  a  probability  y  If  a^=0!2=a,  then  we  see 
ithat  the  rtQ  obtained  from  the  above  equation  agrees  with  the  given 
by  Eq.  (3.3.5)  of  the  previous  model.  This  corresponds  to  the  fact 
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that  a  preemption  scheme  does  not  improve  the  processor  utilization  if 
=a2  =  ' ' • =  a q»  un£*er  the  assumption  of  exponential  distributions. 
Finally,  a  numerical  example  of  processor  time  breakdown  is  shown  in 
Table  3-3,  assuming  that  O^aCi^a.  It  is  clearly  seen  that  the 
priority  1  process  uses  the  processor  much  more  than- the  priority  2 
process  because  of  the  preemptive  priority  scheduling  discipline. 


Table  3-3  Processor  Tine  Breakdown  for 
the  PP  Scheduling  Model 


Processor  Time  Breakdown 

0  -  □/( 

1.0 

1*5 

Priority  1  Process 

Q.bb? 

0.500 

0.400 

Priority  2  Process 

0.25* 

0.300 

0.290 

Idle  Time 

G.07T 

0,200 

0.310 
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3.4.  Dual  Processor  System 

The  model  of  a  single  processor  processing  system  under  multi¬ 
programming  of  Section  3.3  is  now  generalized  to  a  dual  processor  multi 
memory  processing  system  under  multiprogramming.  To  specify  the  gross 
configuration  of  the  system,  we  use  a  notation  (q;  m,  n)  where 

(1)  q  is  the  degree  of  multiprogramming, 

(2)  m  is  the  number  of  processors  (m£q), 

(3)  n  is  the  number  of  primary  memory  units,  each  of  which  can 

be  accessed  through  its  own  memory  controller, 

as  schematically  shown  in  Figure  3-3.  We  will  be  concerned  with  the 
system  throughput  of  the  <q;2,n)  configuration  system  in  Section  3.4. 

Although  the  system  of  this  configuration  is  generally  expected  to 
have  a  much  larger  computational  capacity  than  a  single  processor 
system,  this  system  will  encounter  new  problems  which  may  seriously 
limit  its  computational  power.  These  problems  are  caused  by  multi¬ 
processor  interference  such  as  memory  cycle  interference  and  data-base 
lockout.  These  interference  sources  effectively  reduce  the  speed  of 
the  processing  system.  We  will  therefore  develop  a  model  of  the 
(q;2,  n)  configuration  processing  system  and  evaluate  the  throughput 
of  this  system.  The  effective  loss  of  the  system's  computational 
capacity  due  to  multi-processor  interference,  various  system  overheads, 

aid  the  multiprogramming  (processor)  idle  time  (to  be  defined)  will 
also  be  evaluated. 
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3.4.1  Dual  Processor  Multiprogramming  Model 

A  model  of  multiprogramming  on  the  (q;  2,  n)  configuration  sys te:: 

is  based  on  the  following  assumptions: 

(1)  The  virtual  memory  is  paged.  A  given  page  of  the  primary 
memory  system  is  found  in  a  memory  unit  i  d-i-n;  with  a  proba 
bility  1/n  (uniformness  assumption). 

(2)  The  length  of  in-page  operation  of  a  program  (the  length 
time  during  which  processor  control  remains  within  a  particular 
page  without  making  any  reference  to  other  pages)  follows  an 
exponential  distribution  with  a  mean  of 


(3)  A  page  fault  occurs  with  a  probability  p  (missing-page 
probability)  when  processor  control  is  transferred  to  another  F^gt- 
The  missing-page  probability  is  common  to  all  user  programs. 

(4)  A  missing-page  will  be  fetched  from  the  secondary  memory 
system.  The  length  of  time  required  to  complete  a  page  fetch.,  cr. 
the  page  fetch  time  (pft).  follows  an  exponential  distribution  vitr. 

a  mean  of  1/p. 


(5)  The  processor  queue  (Q^)  is  common  to  two 
over,  the  processing  system  is  assumed  to  have 
all  times  to  ensure  the  existence  of  q  eligible 
multiprogramming  of  degree  q  (full  load  assur.pt 


processor s . 
more  than  c 


processes 
ion'' . 


v.e 


e- 
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(6)  If  both  processors  direct  their  accesses  to  the  same  unit  of 
primary  memory  at  all  tines,  the  execution  time  of  a  program  on 
each  processor  would  be  stretched  by  a  factor  of  V  (l^V-2),  on 
the  average^.  V  is  called  the  memory  (cycle)  interference 
coefficient .  We  say  that  a  memory  unit  has  no  interference  if 
Y=1  and  that  a  memory  unit  has  a  complete  interference  if  Y  =  2. 

First ,  we  will  prove  that  tbpf  aoproximately  follows  an  exponen¬ 

tial  distribution  under  these  assumtions.  Noting  that  hbpf  consists 
of  successive  in-page  operations  (see  Section  2.4.2),  we  see  that 

f^(t)  ^  Prob{  hbpf  =  t  } 


00 

f 

fi  successive  in-  \ 

total  length  of  \  ' 

=  E 

Prob. 

page  operations  v  »Prob< 

i  in-page  operations^ 

i=l 

N 

[before  a  page  faulty 

.=  t  1J 

(3.4.3) 


The  probability  that  i  in-page  operations  take  place  before  a  page 
fault  occurs  is  given  by  a  geometric  distribution  with  a  parameter  of 
p,  as  is  clear  from  the  assumption  (3).  The  sum  of  i  exponentially 
distributed  (identical)  variables  is  known  to  follow  an  Erlang  distri¬ 
bution  of  phase  i,  i.e.,  the  i-fold  convolution  of  identical  exponential 
distributions  [Fl ] .  Therefore,  Eq.  (3.4.3)  becomes 


The  value  of  the  effective  slow-down  factor  due  to  memory  cycle  inter¬ 


ference  (K  )  for  a  given  configuration  will  be  calculated  using  the  value 
m 

of  Y. 
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Thus,  hbpf  has  been  proved  to  follow  an  exponential  distribution  of  a 
mean  of  l/aQp.  This  means  that  the  sum  of  hbpf  (O  and  the  miscella¬ 
neous  overhead  time  (tj  approximately  follows  an  exponential  distri¬ 
bution  with  a  mean  of  (l+fi)/a0p.  Noting  that  the  paging  overhead  time 
is  considerably  shorter  than  the  sum  of  hbpf  and  t^  we  can  expect 
that  tbpf  roughly  follows 

f(t)  =  a  e  "a  t  t  5  0 

where 

l  _  —  —  ^  1+6  _  1 

mtbp f 2  -  — =  <1+6)tul+KJ&tpl  aQp  "axP 

Thus,  we  have  seen  that  the  model  analyzed  in  Section  3.  3  can  be  regard 
ed  as  a  special  case  of  the  model  developed  in  this  section  (see 
Eq.  (3.3.1));  the  previous  model  actually  corresponds  to  a  (q;  1,  n) 
configuration  system  with  the  above  formulation. 

Under  these  assumptions,  each  eligible  user  process  cycles  around 
the  three  states,  i.e.,  the  page-ready  state,  the  running  state,  and 
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the  page-wait  state,  as  shown  in  Figure  3-4.  It  should  be  noted  that 
if  two  processors  happen  to  use  the  same  primary  memory  unit  for  which 
Y^l  then  the  rate  =aQ/(l+6 )  of  Figure  3-4  must  be  replaced  by  a  /y. 

It  is  important  to  note  that  non-resident  programs  (e.g.,  user  programs) 
and  resident  programs  (e.g.,  fault  handlers)  are  treated  as  if  they  are 
equally  susceptible  of  j age  faults  in  this  formulation.  This  formula¬ 
tion  was  adopted  for  the  sake  of  simplicity  of  the  analysis.  Finally, 
we  will  finish  this  section  by  defining  some  terminology  to  be  used  in 
the  following  analysis. 

Multiprogramming  idle  time:  the  processor  time  lost  when  process¬ 
ors  do  not  have  executable  eligible  user  jobs,  i.e.,  when  at  least 
q  -  1  jobs  are  simultaneously  in  the  page-wait  state. 

Memory  interference  idle  time;  the  processor  time  lost  due  to  the 
unavailability  of  memory  cycles  when  two  processors  simultaneously 
access  the  same  unit  of  primary  memory  (memory  cycle  interference). 

Data-base  lockout  idle  time:  the  rocessor  time  lost  due  to  the 
unavailability  of  certain  shared  writable  data-bases  (data-base 
lockout) . 


Besides  these  various  idle  times, we  lose  a  non-negligible  fraction  of 
the  system's  computational  capacity  because  of  paging  overhead  and 
miscellaneous  overhead  of  the  supervisor. 


Figure  3-4  Behavior  of  an  Eligible  Process  (Y  =  l) 
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JL4.2.  Multiprogramming  with  FCFS  Scheduling 

We  will  now  evaluate  the  system  throughput  of  the  (q;  2,  n)  con¬ 
figuration  system  as  well  as  the  multiprogramming  idle  time,  the  memory 
interference  idle  time,  the  date-oase  lockout  idle  time,  and  other 
capacities  wasted  as  various  system  overheads,  under  the  assumptions 
stated  in  the  previous  section. 

In  analyzing  the  behavior  of  the  (q;  2,  n)  configuration  system,  we 

should  first  note  that  a  state  of  the  system  is  characterized  by  the 

number  (i)  of  page-ready  processes  (waiting  in  Q^),  the  number  (j)  of 

running  processes  (using  the  CPUs)  the  number  (k)  of  page-wait  procetaes 

(associated  with  the  SM),  and  a  variable  (s)  which  indicates  the  existence 

of  memory  cycle  interference  (s  =  1  if  interference  exists  and  s=0 

otherwise).  Then,  it  follows  that  each  (i,j,k,s)  tuple  defines  a 
t  k  \ 

state(8Sj  )of  the  system,  as  shown  in  Table  3-4.  The  number  of 
states  thus  defined  is  found  to  be  2q.  We  see  that  these  states  are 
mutually  related  by  the  transitions  with  transition  rates  indicated  in 
Figure  3-5,  and  therefore  the  behavior  of  the  system  can  be  modeled  by 
a  continuous  time  Markov  process  defined  over  these  states. 

Let  A  be  the  steady-state  probability  of  S*  Then,  the  steady- 
state  probabilities  n  can  be  obtained  by  solving  the  balance  equation 
it'AsrO,  i.e.,  the  following  set  of  simultaneous  equations  given  by 
Eq.  (3.4.6)  through  Eq.  (3.4.14), 
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2alp(n"1).  k-l%.  2alp("’1)  ,  k.j  aa1(l-p)(n-l)  ^  ^ .  ^  k+lv 

n  (On2  )+  nY  (ln2  )+  nY  (^J+O^+DPCq^  ) 


(kp  + 


2axp  2o^p(n-l)  k 


+  n  ^ 


)(0n2)  l^ksq-3 


(3.4.6) 


*lp  k-1. .  2aip  x-l  2al(1'p)  ,  k 


nY  lrt2  )+  n  (On2  )+‘ 


—  (0itp+(k-l)P(1n2’‘) 


2a, (l-p)(n-l)  2a,p(n-l)  2a,p  . 

(kp+^-___ -  +  _^__  +  _J_)(ink)  lsksq.3 


(3.4.7) 


Left  Boundary  Conditions 

2a, (1-p) (n-1) 

A  U  \  i  A  / 


(iV+^oV^  « 


i.  .2ai(l'P)  ,2alP  2alP(n-l)  0 

2)n(  _  +  — T  -  )  (0n2) 


(3.4.8) 


2al‘1-p),  0 _  1.  2a|(l-p)(n-l)  Zoj  p(n-l)  2o^P  ^  _  „ 


—  <0«p  +  8  (,«-)-( 


Right  Boundary  Conditions 


(3.4.9) 


2a1p(n-l)  «  2a^p(n-l)  2al(l-p)(n-l)  ^  (q-l)(p-Op^ 

^nno  7+  : 


n  '0  2 


V— <0^  \ 


2a1(i-p) 


((q-2)P  +  — ±_ -  +  2alP)  (QnJ  ‘) 


(3.4.10) 


20!1P  q-3  ^lP  q-l  2ai<l-P>  q-2  (q_1)P  q-1 

“7~(i«5  ^+-7T(0«2  *+“ 4 - (0«2  )+— —  (oni  > 

2a  (l-p)(n-l)  2a  p  , 

((q-2)p  > 


(3.4.11) 


2aip(o<ir^+— v~  <i»r2> + ^(0'o> = < 


o”?'1) 

(3.4.12) 


aip(oni"l)  “  qP(ono) 


(3.4.13) 
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Normalization  Conditon 


^0n0^ 


g<„ 


k, 


q-2 

+  Z  ( 

k=0 


k. 

l’V  = 


(3.4.14) 


Because  there  are  2q  independent  equations  in  2q  unknown  variables,  the 

k  i  . 

snj  s  can  be  uniquely  determined. 

Now  we  are  ready  to  express  several  interesting  performance  measures 
of  the  system's  computation  .1  capacity  in  terms  of  the  steady-state 
probabilities  obtained  above.  It  is  assumed  that  the  system's  computa¬ 
tional  capacity  (2C),  iroasured  in  the  total  number  of  instructions  that 
can  be  performed  by  t  e  system's  two  processors  under  the  ideal  condi¬ 
tion,  breaks  down  ir  to  the  capacity  (C^)  wasted  as  multiprogramming 
idle  time,  the  cap  jcity  (C^)  wasted  as  memory  interference  idle  time, 
the  capacity  (Cp  *  wasted  as  the  paging  overhead  including  the  capacity 

<Cdi>  wasted  as  data-base  lockout  idle  time,  the  ~.apacity  (C  )  wasted 

m 

as  the  miscell  ineous  overhead,  and  the  rest,  i.e.,  the  capacity  (C  ) 
used  as  users  useful  work.  These  capacities  are  evaluated  as  follows: 

Capacity  wasted  as  multiprogramming  idle  time 

Cmi  -  2C  <0"S>  +  C  <0-r‘> 

=  f2<0n0>  +<o,,l"1|^C  (3.4.15) 
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Capacity  wasted  as  memory  interference  idle  time 


q-2 


Cii  =  ^  V  ** 


2C 


k=0 


k=0 


Capacity  wasted  as  the  paging  overhead 


J>2_ 


Cp  =  (2C  -  Cmi-  Cii)*  mtbpf  (  2  CPI'S  ) 
=  (2C  ‘  Cmi”  Cii)*  mtbpf. 


(3.4.16) 


-  (2C  '  Cmi‘  Cii),aK£tPl' 
where  “  =mtbpf2  =  (1+6)^ui  +K/pl  =  ^ 


(3.4.17) 


Cdi  =  (  KjT  1  CP 


(3.4.18) 


Capacity  wasted  as  the  miscellaneous  overhead 


C 

m 


(2C  - 


C  . 
mi 


Cii>’ 


mtbpf  (  2  CPUs  ) 


=  (2C  - 


C 

mi 


C.  .) 

li 


mtbpf2 


=  (2C  -  c  c..)*a6  t 

mi  ii  ui 


(3.4.19) 


152 


Capacity  used  as  users’  useful  work 


C  =  (2C  -  C  -  c  ).— - - — _ 

mi  ii  mtbpf ( 2  CPUs  ) 


=  (2C  -  C  C. .) 

mi  ii' 


mtbpf 


’  <2°  '  Cml”  Cii>,arul 
Meraory  interference  slow-down 


(3.4.20) 


<  LV  (°n2>  +Y  (lfl2)  ]  ]  +  T  (0nl_1) 
[kf0  ^  (0Jr2)  +  (lIt2)  *  1  +T  Vl"^ 


Percentile  throu2hpnf 


8  =  u  • 


mtbpf  (  2  CPUs  ) 


2C-C.-C.  t 
_ mi  li  ui 

2C  mtbpf 


(3.4.21) 


2C  (3.4.22) 

It  is  seen  from  Eq.  (3.4.22)  that  the  percentile  throughput  of  the 
(q;2,n)  configuration  system  is  expressed  as  a  function  of  user 
program  characteristics  (t^p),  the  system  overhead  structure  (t  p6,K 
and  hardware  characteristics  (n,0,Y).  The  system  throughput  0  o/this  ^ 
configuration  can  be  obtained  by  substituting  Eq.  (3.4.22)  into 
Eq.  (3.2.8). 

An  expression  for  gJtJ  is  generally  too  complicated  to  write  out 
and  therefore  the  expressions  given  by  Eq.  (3.4.15)  through  Eq.  (3.4.22) 
can  only  be  numerically  evaluated  (this  will  be  done  in  Chapter  5), 
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except  for  some  simple  cases.  However,  one  of  those  simple  cases  for 
which  xk  is  not  too  complicated  may  be  worth  being  explicitly  analyzed 

s  j 

as  an  illustrative  example. 


Example  A  (2;2,n)  Configuration  System 

The  state  transition  diagram  of  Figure  3-5  now  reduces  to  the  four 
state  transition  diagram  given  below. 


The  equations  given  by  Eq.  (3.4.6)  through  Eq.  (3.4.14)  correspondingly 
reduce  to: 

<o”2> 

2a,  (1-p)  n  r  1  2a1(l-p)(n-l)  2axP  0 

-V-V2>+i(0"!>  “  - +— !  <l’t2) 

2a  p  n  7  l 

2alp  +~Y~  (1IT2)  +2p  (oV  =  (  P+alp)  (0nl) 


alp  =  2p 

(0*2}  +  +  (0*1}  +  <0*S>  =  1 
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Solving  these  equations,  we  get 


o*o 


ncr 


n(l-KT;  +  Y  -1 


1 

0*1 


_ 2nO _ 

n(l+cr)2+ Y  -1 


0  n-1 

0^2  =  2 

u  n(l+a)4vl 

0  Y 

lrt2  =  2 

n(l+a)>  Y-  1 


where  a  =axp/p  =  a/p. 

Thus,  from  Eqs.  (3.4.15)  and  (3.4.16),  we  obtain  the  following 
result  concerning  the  capacities  wasted  as  multiprogramming  idle  time 
and  as  memory  interference  idle  time. 


C  . 
mi 


-■?  (12+g>  *2C  .  -  -2C 

n(l+CT)+Y-l  (l+a)+(-J-^) 

n 


C.  . 

ii 


Y-l 


■•2C 


(Ir> 


n(l+CT)+Y-l  (1+CT  )2+(^) 


•2C 


The  system's  computational  capacities  used  as  various  overheads  (C  C  ) 

p’  nr 

and  as  users'  useful  work  (Cu)  respectively  follow  from  Eqs.  (3.4.17), 
(3.4.19),  and  (3.4.20).  The  memory  interference  slow-down  factor  is 
obtained  from  Eq.  (3.4.21). 

_  (n-l)+ Y2+ng 
m  (n-1)  +  Y  +  ncr 


Finally,  the  percentile  throughput  is  obtained  from  Eq.  (3.4.22)  as 
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e 


l+CT 


(l+a)2  +  ("■) 


•at 


u  1 


It  should  be  noted  that  as  (Y-l)/n  approaches  zero  (either  n-®,i.e., 
an  infinite  number  of  memory  units,  or  y  -*1,  i.e.,  no  memory  cycle 
interference)  the  effect  .of  memory  cycle  interference  disappears. 
Furthermore,  the  multiprogramming  (processor)  idle  probability  of  this 
dual  processor  system  under  this  limit  is  Cm^/2C  =  Ct/(1-Kt).  Although 
the  multiprogramming  (processor)  idle  probability  of  a  (1;  1,  n)  con¬ 
figuration  system  was  also  found  to  be  or/(l-K7)  from  Eq.  (3.3.5)  of  the 
previous  section,  the  split  system  (q;  1,  n)  generally  gives  a  larger 
multiprogramming  idle  probability  than  lue  (combined)  dual  processor 
(2q;  2,  n)  configuration  system,  under  the  no  memory  interference 
condition  mentioned  above.  Needless  to  say,  this  does  not  necessarily 
mean  that  the  (2q;  2,  n)  system  is  morr  efficient  than  the  (q;  2,  n) 
system;  the  comparison  must  be  made  in  terms  of  perfcentile  throughput 
rather  than  multiprogramming  idle  probability. 
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3.5.  Some  Extensions 

We  have  been  concerned  with  the  throughput  of  the  multi-processor 
multi-memory  processing  system  under  multiprogramming.  In  particular, 
we  have  presented  an  analytical  method  to  evaluate  the  throughput  of  a 
single  processor  and  a  dual  processor  systems  under  multiprogramming. 

In  this  section,  we  will  give  brief  remarks  on  some  extensions  of  the 
results  obtained  in  this  chapter. 

The  ; tudy  of  the  system  throughput  of  this  chapter  has  been  mainly 
confined  to  the  FCFS  scheduling  model;  a  brief  investigation  of  the  PP 
(preemptive  priority)  scheduling  model  for  a  single  processor  system 
left  the  value  of  the  mtbpf  of  each  process  with  a  distinct  priority 
unspecified.  It  is  perhaps  possible  to  determine  the  values  of  these 
n  tbpf  s  and  then  evaluate  the  throughput  of  such  a  model,  by  assuming 
that  the  processor  time  breakdown  among  eligible  user  processes  is 
proportional  to  the  primary  memory  space  usage  breakdown  among  them. 
Then ,  it  would  be  possible  to  evaluate  the  performance  gain  of  the  PP 
scheduling  model  over  the  FCFS  scheduling  model  quantitatively,  using 
the  macroscopic  paging  performance  model  of  Section  2.5.  As  for  the 
dual  processor  system,  even  the  analysis  of  the  FCFS  discipline  model 
was  rather  complicated.  Inclusion  of  the  PP  scheduling  would  increase 
the  number  of  Markovian  states  of  this  model  and  necessarily  complicate 
its  analysis.  Therefore,  no  attempt  to  study  the  dual  processor  PP 
scheduling  model  is  made  in  this  thesis. 

Processing  systems  involving  more  than  three  processors  have  not 
been  considered.  In  order  to  examine  the  complexity  of  such  systems, 
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let  us  consider  a  general  (q;m,n)  configuration  system.  Speaking  of 
memory  cycle  interference  alone,  there  exist  many  kinds  of  memory  cycle 
interferences  on  this  system,  such  as  interference  between  two  process¬ 
ors,  that  among  three  processors,...,  that  among  m  processors. 

Therefore,  we  must  define  a  different  V  (memory  interference  coefficient) 
for  each  kind  of  memory  cycle  interference.  Such  a  complicated  formula¬ 
tion  of  the  problem  5  however,  tends  to  reduce  the  mathematical  trar.tabil- 
ity,  and  therefore  it  seems  necessary  to  consider  the  effect  of  memory 
cycle  interference  indirectly  much  like  the  effect  of  data-base  lockout 
on  the  (q;  2,  n)  configuration  system  was  considered  in  this  chapter. 

Finally,  let  us  briefly  consider  the  optimization  of  the  multi¬ 
processor  multi-memory  processing  system  under  multiprogramming,  with 
respect  to  the  degree  of  multiprogramming.  The  method  of  evaluating  the 
system  throughput  developed  in  this  chapter  enables  us  to  investigate 
the  effect  of  the  degree  (q)  of  multiprogramming  upon  the  system  through¬ 
put  of  a  given  hardware  configuration.  If  the  system  allows  only  a  few 
eligible  user  processes  the  percentile  throughput  would  be  small  because 
of  relatively  large  multiprogramming  idle  time.  On  the  contrary,  if  the 
system  allows  too  many  eligible  user  processes  the  percentile  throughput 
would  be  again  small  because  of  relatively  frequent  paging  overhead 
execution.  Therefore,  we  expect  a  modal  curve  for  the  percentile 
throughput  like  the  one  shown  in  Figure  3-6;  the  percentile  throughput 
is  maximized  when  the  degree  of  multiprogramming  has  a  certain  value 
(q*) .  This  optimization  will  be  carried  out  numerically  in  Chapter  5, 
and  therefore  detailes  will  not  be  given  in  this  section. 


degree  of 
multiprogramming 


Figure  3- 


Percentile  Throughput  as  a  Function 
of  the  Degree  of  Multiprogramming 


CHAPTER  4 


RESPONSE  TIME  ANALYSIS 


4.1.  Introduction 

Now  that  a  method  to  evaluate  the  throughput  (or  the  percentile 
throughput)  of  a  multiprogrammed  virtual-memory  computer  system  of  a 
given  multi-processor  multi-memory  configuration  has  been  developed,  we 
shall  proceed  in  this  chapter  to  evaluate  the  response  time  experienced 
by  interactive  users  of  such  a  time-shared  computer  system.  From  a 
system  manager's  viewpoint  efficient  utilization  of  system  resources, 
or,  more  concretely,  the  maximization  of  the  system  throughput  is  most 
important.  On  the  other  hand,  service  of  good  quality  such  as  fast 
system  response  or  high  system  reliability  is  most  desirable  from  a 
user's  viewpoint.  As  will  be  described  in  this  chapter,  the  system 
response  time  and  the  system  throughput  are  closely  related.  In  fact, 
the  response  time  analysis  of  this  chapter  is  firmly  based  on  the 
system's  percentile  throughput  that  was  evaluated  by  the  throughput 
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analysis  of  Chanter  3. 

It  will  be  seen  that  the  maximization  of  the  system  throughput 
generally  leads  to  the  minimization  of  the  system  response  time. 
Therefore,  the  computer  system  under  study  is  assumed  to  be  optimized 
with  respect  to  the  degree  (q)  of  multiprogramming;  i.e._.  the  system  is 
under  multiprogramming  of  degree  q*  (q=  q*),  where  the  percentile 
throughput  (or  system  throughput)  of  the  system  of  a  given  configuration 
is  maximized  by  the  throughput  analysis  of  Chapter  3  when  q  is  equal  to 
q*  (see  Figure  3-6).  It  is  clear  that  if  interactive  users  impose  a 
full  load  on  the  system  the  percentile  throughput  of  the  system  (or 
system  throughput)  is  independent  of  the  actual  number  of  users  (because 
of  the  load  leveling  of  the  multiprogramming  mechanism). 

On  the  other  hand,  the  response  time  is  clearly  an  increasing 
function  of  the  number  of  users.  Therefore,  one  may  ask  how  many 
interactive  users  can  be  supported  by  such  an  optimized  computer  system 
without  discouraging  them  with  excessively  slow  system  responses. 

Scherr  [S4]  considered  the  problem  of  analyzing  the  oerformance  of 
a  CTSS-like  computer  system  and  proposed  a  simple  queuing  model  to 
predict  the  average  response  time.  Because  the  computer  system  under 
study  in  this  thesis  is  much  more  sophisticated,  the  model  must  reflect 
those  sophisticated  features  of  the  system  such  as  paging,  multi¬ 
programming,  etc.  Therefore,  we  will  extend  the  queuing  model  used  in 
Scherr' s  analysis  in  the  following  two  directions.  First,  his  model 
(developed  for  uniprogrammed  non-paging  computer  systems  like  the  CTSS 
system  [C 9 J )  will  be  generalized  to  multiprog rammed  paging  computer 
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systems.  Second,  in  view  of  the  fact  that  interactive  terminal  users 
are  more  concerned  with  something  like  the  worst  response  time  out  of 
ten  trials  rather  than  the  average  response  time,  the  analysis  to  be 
given  in  this  chapter  will  derive  a  probability  distribution  of  response 
times.  (Scherr  derived  only  the  average  response  time  of  a  simpler 
system.)  Thus,  we  can  evaluate  the  moment  of  any  order  (average, 
variance,  etc)  of  response  times,  and,  perhaps  more  importantly,  we  can 
derive  the  percentile  system  response  time,  i.e.,  the  time  limit  which 
guarantees  that  a  certain  proportion  of  response  tines  is  shorter  than 
this  limit  (e.g.,  the  90  percentile  system  response  time). 

The  analysis  of  this  chapter  makes  it  possible  to  express  the 
distribution  of  the  response  time  of  a  multiprogrammed  time-shared 
virtual -memory  computer  system  of  a  given  configuration,  as  a  function 
of  various  system  parameters  such  as 

(1)  the  number  of  processors 

(2)  the  percentile  throughput  0  of  the  processing  system 

(3)  the  number  of  (interactive)  users 

(4)  think  time  of  each  user 

(5)  execution  time  required  by  each  user's  job 

This  implies  that  it  now  becomes  possible  to  determine  the  number  of 
interactive  terminal  users  that  can  be  supported  by  the  optimized 
computer  system  of  a  given  hardware  configuration  with  the  assurance 
that,  for  example,  the  90  percentile  system  response  time  is  shorter 


than  five  seconds. 
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4.2.  Total  System  Model 

Let  us  consider  an  m  (1  processor  multiprogrammed  virtual- 

memory  computer  system  optimized  with  respect  to  the  degree  of  multi¬ 
programming,  with  N  interactive  terminal  users.  When  this  system  is  in 
operation,  the  existence  of  N  interactive  users  however  does  not  assure 
the  existence  of  q*  (q*=  optimum  degree  of  multiprogramming)  jobs  that 
can  be  multiprogrammed  on  the  system.  The  actual  number  (q')  oi  jobs 
(user  processes)  under  multiprogramming  may  become  smaller  than  q*  from 
time  to  time  if  the  total  number  (N)  of  interactive  users  is  not  large 
enough  to  impose  a  full  load  upon  the  system,  as  shown  in  Figure  4-1  (a). 
Correspondingly,  the  actual  percentile  throughput  (0')  should  become 
smaller  than  0^  (0^=  optimized  percentile  throughput)  at  least  from 
time  to  time,  under  the  same  loading  condition.  However,  we  do  not 
know  exactly  how  0'  reacts  to  a  sudden  change  of  q\  If  0'  can  almost 
instantaneously  react  to  a  change  of  q'  (the  case  of  fast  throughput 
reaction) .  then  6'  as  a  function  of  time  appears  like  a  step-wise 
function  for  which  the  height  of  each  step  is  simply  given  by  the  steady- 
state  solution  for  0^  (the  percentile  throughput  when  there  exist  q 
eligible  user  processes)  obtained  in  Chapter  3,  as  shown  by  the  solid 
line  in  Figure  4-1  (b) ;  otherwise,  0’  should  show  a  humping  curve  as 
shown  by  the  broken  line  beside  the  step-wise  solid  line  in  the  same 
figure.  If  the  reaction  of  0'  is  very  slow  (the  case  of  slow  through¬ 
put  reaction) .  ©'  may  be  reasonably  approximated  by  a  constant  function 
[S6J,  as  shown  by  the  horizontal  line  in  Figure  4-1  (b)  ;  the  value  (height) 
of  this  function  may  be  given  by  the  average  of  the  percentile  through- 


(a)  Number  of  User  Pr  icesses  under  Multiprogramming 


—  ■  —  fa  t  throughput  reaction 

-  moc 2rate  throughput  reaction 

- s lo  *  throughput  reaction 


(b)  Percentile  Throug  put 


Figure  4-1  Sample  Behavior  of  a  Typical  Computer  System 
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put  of  the  system  in  operation,  which  we  call  the  effective  percentile 
throughput  9(N).  That  is  to  say, 

w  =  {(311"iei)+(1yIv)}/<1-v  (4-2-l) 

where  n ^  is  th  •  probability  that  i  user  jobs  are  either  waiting  or  being 

serviced  on  the  ictual  processing  system,  is  the  probability  that  no 

jobs  are  being  serviced  (i.e.,  the  system  is  completely  idle),  and  0. 

(i=l,2, . . .,q*, . . . )  is  the  percentile  throughput  of  the  system  under 

multiprogramming  of  degree  q  =i  .  It  should  be  noted  that  n.  is  a 

1 

function  of  N  (the  number  of  users)  as  well  as  of  other  system  parameters. 
This  means  that  the  effective  percentile  throughput  of  the  system  under 
multiprogramming  is  dependent  on  the  number  of  users  unlike  that  of  the 
system  under  uniprogramming.  (0(N)  will  be  often  denoted  simply  as  0 
hereafter. ) 

We  shall  approximate  the  behavior  of  a  multi-processor  processing 
system  of  such  an  actual  computer  system  with  that  of  a  hypothetical 
multi-processor  processing  system  of  Figure  4-2,  for  which  the  system's 
computational  capacity  can  be  fully  utilized  for  users'  useful  work, 
but  its  execution  speed  is  0  (Os0si)  times  as  slow  as  the  actual 
processing  system.  We  assume  that  each  processor  of  the  hypothetical 
system  services  user  jobs  one  by  one  up  to  completion,  under  the  FCFS 
scheduling  discipline.  A  discussion  on  the  possible  modeling  errors 
will  be  collectively  given  later  in  this  chapter  and  therefore  that 
is  not  our  current  concern. 

Finally,  in  order  to  characterize  the  behavior  of  interactive 


Actual  System  Hypothetical  System 


2 

throughput  8  x  10  $  100  ^ 

speed  1  q 


Figure  4-2  Tradeoff  of  Execution  Speed  and 
Percentile  Throughput 
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terminal  users,  we  assume  the  following  : 

(1)  A  user's  think  time  (Tfc),  i.e.,  the  length  of  time  required  by 
an  interactive  user  to  decide  and  request  the  next  job  from  a 

terminal,  independently  follows  an  exponential  distribution  with  a 
mean  of  T  =  l/jj. 

"M.  t 

f  (t)  =  \i  e  t&o  (4.2.2) 

(2)  The  execution  time  (Te>  of  each  user's  job  (excluding  all  the 
system  overhead  times  to  be  added)  independently  follows  an  expo¬ 
nential  distribution  with  a  mean  of  Te-1A  ,  on  the  actual  multi¬ 
processor  computer  system. 

-X  t 

g0(O  =  \je  t^O  (4.2.3) 

Available  measurement  results  of  actual  computer  systems[e.g.,  S4] 
indicate  that  both  of  these  distributions  are  roughly  exponential^. 
Therefore,  these  two  assumptions  seem  to  be  fairly  reasonable.  The 
second  assumption  obviously  implies  that  the  execution  time  of  each 
user's  job,  when  executed  on  the  hypothetical  system,  follows  an  expo¬ 
nential  distribution  with  a  mean  of  1/0=  1/QXq  s  l/\.  That  is  to  say, 

g(t)  =  le‘U  t^O  A 

where  X  =  9  XQ  J  (4.2.4) 


The  eiiect  of  non-exponential  distributions  upon  the  response  time 
characteristics  of  a  similar  model  is  considered  by  Jaiswal[J2]  and 
D'Avanzo [D4 ] . 
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Thus,  the  behavior  of  an  actual  multiprogrammed  virtual-memory 
computer  system  depicted  in  Figure  1-1  is  now  approximated  with  that  ol 
the  hypothetical  system,  called  the  total  system  model,  like  the  one 

shown  in  Figure  4-3. 


Figure  4-3  Scherr's  Model  for  Uniprogrammed  Non- Paging  Time-Shared 
Computer  Systems 
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4.3.  Analysis  of  the  Model  Behavior 

It  should  be  noted  that  the  total  system  model  developed  in  the 
previous  section  is  basically  the  multi-server  machine  repairman  model 
or  an  M/M/S1  queue  with  a  finite  population  [C8,F1,J2].  We  shall 
proceed  to  derive  a  response  time  distribution  of  the  computer  system 
under  study  using  the  total  system  model,  step  by  step,  in  this  section 


4.3.1.  Derivation  of  Queue  Length  Distribution 

As  the  first  step  in  deriving  a  response  time  distribution,  this 
section  is  concerned  with  the  derivation  of  a  distribution  of  the  £ueue 
length,  i.e.,  the  number  of  user  jobs  either  waiting  or  being  serviced 

(executed)  on  the  processing  system. 

Noting,  as  before,  that  a  state  of  the  model  is  characterized  by 

the  number  of  interactive  users  currently  thinking,  the  number  of  inter¬ 
active  users  (jobs)  queuing  in  front  of  the  hypothetical  processing 
system,  and  the  number  of  interactive  users  (jobs)  being  serviced  by 
the  processors  (see  Figure  4-3),  let  St  <0*i*N)  denote  a  state  of  the 
model  where  i  user  jobs  requested  by  the  corresponding  i  interactive 
users  are  either  being  serviced  or  waiting  for  their  turn  to  be  serviced 


1The  first,  second,  and  third  components  of  this  notation  specify 
respectively  the  type  of  the  arrival  process,  the  type  of  the  service 
process,  and  the  number  of  servers  of  the  queuing  system.  In  this  case, 
both  the  arrival  and  service  processes  are  Markovian  (i.e.,  exponential) 

and  the  queuing  system  has  £5  servers. 
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on  the  processing  system,  i.e.,  the  queue  length  is  i.  (We  assume  N  to 
be  constant  and  that  the  service  for  one  of  the  waiting  user  jobs  starts 
as  soon  as  a  processor  becomes  available.)  Then,  the  behavior  of  this 
stochastic  process  can  be  schematically  described  by  a  state  transition 
diagram  shown  in  Figure  4-4.  Because  the  effective  execution  speed  of 
the  processors  depends  on  the  value  of  0'  (the  "actual"  percentile 
throughput)  at  that  moment,  the  rates  (the  rate  with  which  one  of  the 
user  processes  on  the  processing  system  completes  its  execution  when 
there  exist  1  (l^i^N)  user  processes  on  the  processing  system)  depends 
on  how  0'  reacts  to  the  changes  of  the  actual  number  (q1)  of  user 
processes  under  multiprogramming.  Therefore,  the  rates  X.^  are  specified 
for  two  cases,  i.e.,  the  case  of  fast  throughput  reaction  and  that  of 
slow  throughput  reaction,  in  Figure  4-4. 

Now  let  (O^i^N)  be  the  steady-state  probability  of  S^.  Then, 
the  steady-state  probabilities  can  be  obtained  [FI]  by  solving 

(N-i)|i«i  s  ^i+l^i+l  O^iSN-1  (4.3.1) 


N 

under  the  condition  Hit.  si,  Thus,  it.  is  obtained  by  recursion,  for  the 

i=0  1 


case  of  slow  throughput  reaction,  as  follows: 


'  /Nx  i 
=  <  (j)  P  «0 


N! 


v  m!  (N-i)!  i-m,ir0 
m 


l^i^n 


mSi^N 


where 


p  =  X  ~  0X, 


0  0T. 


*0  = 


N! 


m-i  i  N 

A(i)P  +  .S  m!(N-i)!'  i-m 
.  i=U  i=m  m 


i  ’-1 


V  (4.3.2) 
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Figure  4-4  State  Transition  Diagram  of  the 

Behavior  of  the  Total  System  Model 
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Note  that  it ^  is  the  probability  that  one  finds  i  user  jobs  either  wait¬ 
ing  or  being  serviced  on  the  processing  system  at  a  randomly  chosen 

instant  after  the  system  has  operated  for  a  long  time.  Therefore  it 

}  i 

gives  a  steady-state  probability  distribution  of  the  queue  length  found 
at  a  randomly  chosen  instant. 

The  analysis  of  the  case  of  fast  throughput  reaction  is  exactly 
analogous  to  that  of  the  above  case,  and  therefore  we  will  consider 
only  the  case  of  slow  throughput  reaction  hereafter. 

w  let  P^  (OSiiN)  be  the  steady-state  probability  that  an 
arrlvln8  user  job  (one  of  the  N  users)  finds  i  jobs  either  waiting  or 
being  serviced  on  the  processing  system  at  the  instant  it  joined  the 
job  queue  Q.  In  other  words,  is  the  distribution  of  the  queue  length 
an  arrl-vln8  instant  of  a  user  job.  Then,  noting  that  the  expected 
number  of  arriving  users  who  find  exactly  i  user  jobs  ahead  of  themselves, 
during  a  duration  of  time  T,  is  given  by  (N  -  i)|a  it  i  •  T,  we  find  that  P1 
is  expressed  as 


(N-i)|i  it,  T 


2  (N-j)nit.  T 
j=0  J 

N 

where  Q  =  £  j  it . 

j=0  J 


N-i 

-_.it. 

N-Q 


\ 


> 


J 


(4.3.3) 


It  should  be  emphasized  that  Pi  is  different  from  it  in  nature;  for 
example,  PN  is  zero  because  no  user  can  request  a  new  job  when  jobs  of 
all  interactive  users  are  either  waiting  or  being  serviced  on  the 
processing  system,  while  it^  if  obviously  non-zero. 
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4.3.2.  Derivation  of  Effective  Percentile  Throughput 

Now  it  should  be  noted  that  the  queue  length  distributions,  it  and 
p.,  of  the  previous  section  involve  0(N),  as  clear  from  Eq.  (4.3.2), 
whose  value  is  still  unknown  because  ru  is  not  known  (see  Eq.  (4.2.1)). 
Therefore,  we  proceed  to  determine  the  value  of  0(N)  in  this  section. 

It  is  reasonable  to  expect  that  (of  the  hypothetical  system) 
gives  a  fair  estimate  of  jk  (of  the  actual  system)  if  0(N)  can  be 
accurately  estimated.  This  argument  suggests  that  one  solve  Eq.  (4.3.2) 
and 


,<*•1  , 

/  N 

i  / 

.  ( ^  ej  + 

Lvi)( <’ 

»}/ “-V 

(4.3.4) 

simultaneously.  (k  of  Eq.  (4.2.1)  was  replaced  by  j^,  in  Eq.  (4.3.4).) 

Therefore,  we  must  solve  these  non-linear  simultaneous  equations. 

However,  if  we  note  that  any  distribution  of  rr ^  obtained  from  Eq.  (4.3.2), 

assuming  a  certain  value  fo  ■  9(N),  gives  an  estimate  of  0(N)  which  is 

bounded  by  6,  and  0_^,  then  we  find  that  the  value  of  0(N)  may  be 
"  X  CJ* 

iteratively  determined,  as  flollows. 


^The  analysis  of  this  section  is  required  only  for  the  case  of  slow 
throughput  reaction. 
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Procedure  to  determine  the  value  of  6(N) 


A 


Step 

1 

Set  the  variable  9  to  9  *,  i.e.,  0=  0  *. 

q*  '  q 

Step 

2 

A 

Assuming  that  0(N)  =  9,  evaluate  using  Eq.  (4.3.2). 

A 

(;q*"1  ,  ,  N  i  w 

Step 

3 

Compute  9  = 

U  ”iei!+  E  ”i)Vl/<1'V- 

(.1=1  i=q*  '  4  )• 

Step 

4 

If  |©(N)  -  0 

|  Sc,  then  go  to  the  next  step.  Otherwise, 

return  to  Step  2.  (The  value  of  e  should  be  small  enough 
to  assure  the  desired  accuracy  of  the  estimation.) 

A 

Step  5  6  thus  obtained  is  an  estimate  of  6(N). 


The  initial  overestimation  of  0(N)  in  step  1  leads  to  the  under- 
N 

estimation  of  £  it.  in  step  2,  which  in  turn  leads  to  the  underestimation 
i=q* 

of  9(N)  in  step  3.  (See  Figure  4-5)  The  condition  of  step  4  is  usually 

not  satisfied  and  therefore  we  must  return  to  step  2.  The  underestima- 

N 

tion  of  0(N)  then  leads  to  the  overestimation  of  £  jt  in  step  2  this 

i=q*  1 

time,  which  in  turn  leads  to  the  overestimation  of  0(N)  in  step  3.  But 
the  overestimated  value  of  0(N)  is  certainly  smaller  (closer  to  the  0(N) 
being  estimated)  than  the  initially  overestimated  value  of  0(N),  i.e., 

©q*-  Repeating  this  kind  of  oscillation  around  the  value  being  estimated, 
as  shown  in  Figure  4-5,  the  value  of  0(N)  can  be  determined  by  the 
above  iterative  procedure.  In  deriving  a  numerical  example  to  be 
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L 


Figure  4-5  Iterative  Estimation  of  Effective  Percentile  Throughput 


176 


Mven  in  Chapter  5,  it  was  observed  that  0  converges  within  several 

iterations  when  e  was  chosen  to  be  about  0.2  percent  of  0(N).  If  the 

number  of  users  (N)  is  large  enough  to  impose  a  heavy  load  upon  the 
N 

system,  thenj^-  1  and  0-0^.  Therefore,  the  above  procedure 
converges  without  need  for  iterations.  It  was  furthermore  observed 
that  the  use  of  (rather  than  0^)  as  the  initial  estimate  for  0(N) 
in  step  1  leads  to  the  convergence  to  the  same  value  of  6.  Therefore, 

1  r  ir*  felt  that  this  iterative  procedure  uniquely  determines  the  value 
of  0(N )  . 


4.3.3.  Derivation  of  Response  Time  Distribution 

The  system  response  time  is  generally  defined  as  the  time  elapsed 
between  the  receipt  by  the  system  of  a  user's  specified  Job  request  and 
he  satisfaction  of  that  request  at  the  terminal.  For  our  total  system 
model,  we  define  the  jsystem)  response  time  (T.)  to  be  the  total  length 
of  time  which  is  spent  on  the  processing  system  by  each  user  Job. 

herefore,  the  system  response  time  consists  of  both  the  waiting  time 
and  the  execution  time  of  the  Job. 

Let  p(Tr=t}  and  P(s)  be  the  probability  density  of  the  system 
response  time  and  the  corresponding  Laplace  transform.  Then, 
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!>  =  \  P  l  T  =  t  } 

1  a  r 


-st  , 
e  dt 


r»  f  N 

=  1  F  P.  p{  T  =  1 1  i  jobs  }  e‘St  dt 

J  0  ( 1=0  r  > 


(4.3.5) 


=  ^  |pi  jo  ?{  Tr=t  |  i  jobs  }  e‘St  dtj  (4.3.5) 

where  p{T.=t|i  jobs}  denotes  a  conditional  probability  density  of  the 
system  response  time  given  that  i  jobs  were  found  ahead  of  an  arriving 
job  on  the  processing  system.  Noting  that  the  m-processor  (Um«») 
processing  system  schedules  arriving  jobs  (whose  execution  times  are 
exponentially  distributed)  according  to  the  FCFS  discipline,  we  see  that 
P{  Tr=  t  |  i  jobs  }e'Stdt  x 


.  i+l-m 


. (-L  ) 

\  s+m X/  \  s+x  / 


0  s  i  sm-l 


msi  sn 


(4.3.6) 


where  the  first  component,  (^)  1+1  ^  of  the  latter  case  represents 

the  Laplace  transform  of  the  waiting  time  and  the  following  second 

.  X  . 

component,  (g+^),  represents  the  Laplace  transform  of  a  job's  execu¬ 
tion  time.  Substituting  Eq.  (4.3.6)  into  Eq.  (4.3.5),  we  get 


(4.3.7a) 


N!  nQ  X  |m-l 


)  N  ( 

/  mX 

i+l-m  ,  \ 

I  +  r  Jd 

.  '_L\1 

L  !p 
J  irrllA 

i} s+mX 

*  s+X  !) 

N-Q  s+X  lifb  +  4 


/  mX  \ 

-m  | 

's+mX  1 

I 

(4.3.7) 


can  thus  derive  the  k-th  moment,  E«Tr)k),  of  the  system  response 

time,  using  a  well-known  technique  of  differentiating  Pfs)  and  then 
setting  s  to  zero  [H2]. 


E((Tr)k)  =  (-l)kiL PISl  k  =  1,2,3,...  (4.3.8) 

ds  s=0 

In  particular,  the  average  response  time  is  obtained  as  follows. 


E(T  ) 
r 


.  iLCsl  I 

ds  I s=0 

N!nrt  /  m-1 


.  1^(7—^—). 

N  -  Q  \  i=0  i.'(N-i-l)  •  / 


N !  ir  _  /  N- 1  < 


N  !jt_  X  N-l  /  1  ,  i._  , 

N-Q  s+A  i=m  (m!  (N-i-1)  !m  I  s-Hn  X/  (s+mA)2J  s=i 


A  1 

(s+A)^  1  s=0 

1  mX  )‘+1'l 

1  *  X 

'  s-Hnxl  j  , 

1  (s+A)2  s=0 

"  "mX  if 

(s4mA)  )  I  s=0 


,  Mfm-1 


N-l 
+  Z  ■ 


N-Q  \  A  /U-0  i!(N-i-l)!  i=m(m!  (N-i-1)  •  mi_m+1 


m-1  i  N  i 

i  - 2 —  +  i  - - 

i=0  i  !  (N- i) f  i=m  m! (N- i) •  m*"rn 

- — - —  •  T 

m-1  i  N-l  i  t 

T.  - -  +  Z  - S - - - 

i=0  i!  (N-i-1)  f  i=m  m!(N-i-l)!ml'm  lsmsN 


(A. 3.9) 


This  result  agrees  with  Scherr's  result  [S4]  if  6(N)  s  l  .  Some 
special  cases  of  Eq .  (4.3.9)  may  be  worth  mentioning. 
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JJL 


E  (T.) 


N 

Z 

1=0  (N- 1) * 


N-l 

Z 


=  i 


1=0  (N- 1-1) • 


(  m=  1  ) 


(4.3.10) 


—  (  m  =  N  ) 

Obviously,  the  result  for  m=N  means  that  if  a  processor  can  always 
start  its  service  for  a  user's  job  as  soon  as  the  job  is  submitted  by 
an  interactive  user  the  average  response  time  is  given  by  the  average 


execution  time  of  the  job. 

Now  we  proceed  to  invert  P(s)  to  obtain  the  probability  density 
of  the  system  response  time  p{  Tf=  t  }  .  Let 


P^s) 


s+A. 


P2(s) 


_L 

s+A.  V  s+mA  I 


(4.3.11) 


Then,  we  must  invert  P^s)  and  P2(s)  in  order  to  invert  P(s)  given  by 
Eq.  (4.3.7).  Denoting  the  inverse  Laplace  transform  operator  byXS^, 


X_1[  Pj(s)  ] 


t  0 


(4.3.12) 


Next,  we  perform  a  partial  fraction  expansion  for  P2(s)  as  the  first 
step  to  invert  P2(s),  assuming  temporarily  that  m^l. 

,  »  .  ,i+l-m 

?2<s>'i5r(=r) 


s4mX 


(s+mA) 


+ 


+  Cj-fl-m 

,  .  ,  .i+l-m 
(s+mA) 


(4.3.13) 

m  1  1 


where  c^  (Oskfii+l-m)  are  the  unknown  constants  to  be  determined. 
These  constants  are  easily  found  to  be 
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=0  ■  ‘«*>v>L»-»(*) 

7T  £  [(S*"X) 


i+l-m 


i+l-tn 


P2(s) 


1 


J - ^  0n«1+l‘” 

j!  <s+),)1+1 


_  “(mX) 


(-D- 

i+l-m 


s=-mA 


s=-mX 


(m-l)J+1  \J 


Now,  noting  that 


L -1 

'  1 

[  (s+a)K  J 

.k-l  -at 
t  e 

(k-1)! 


1 


r 


0  ^  J  S  i-ni,  m  /  1  J 


(4.3.14) 


k=  1,  2,  3, 

P2(®)  specified  by  Eqs.  (4.3.13)  and  (4.3.14)  can  be  inverted. 

oC'1  (vs)) 


(4.3.15) 


,  i+l-m 


-xt  i;m 

e  “  ^  j+i  j 

J=0  (m-l)JTi  XJ 


,  ..i+l-m  . i-m-1  -mXt 

(mX)  t  t  J  e 

(i-m-j)! 


(4.3.16) 

m  ^  1 


Using  the  results  of  Eqs.  (4.3.12)  and  (4.3.16),  the  probability 
density  of  the  response  time  of  the  m-processor  system  (ra^l)  is 
obtained  as  follows: 
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p{Tr=t  } 

= JC 1  (pcs)] 


N!nn  /  m-1 

- -  (  E 


N-Q  l  i=0  i!  (N-i-1)  ' 


X  e 


-Xt 


»!”o  V)  £  -I  xpL-f*'" 


Xe 


-Xt  m-1 


nO 


N-Q 


N-Q  \  m!  /  0  i=( 


N-tn-1/ 

E 


0  I  (N-m-1-1) ! (m-1) 


-xc.  .-xt  3  f(-x>xt]3  3 

l+>1  J=0  J' 


m 


/I 


(4.3.17) 


For  the  single  processor  system  (m-1),  Eq.  (4.3.7)  reduces  to 


V  ikf , 


y  \  ".i-U -  p 

**  (N-i-1)! 


»  I  X  \l+1  1  1=0  (N-1- 


(4.3.18) 


jfo  75:7577 


Therefore,  the  probability  density  of  the  response  time  of  the  single 
processor  system  is  obtained,  using  Eq.  (4.3.15),  as  tollows: 
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pf  T.=  t  } 

=  X"1  [p(s)J 

—  p1  X 

(N-i-l)t  it 

?  ~~ 
jfo 

V  (H-1)!  o‘ 

1=0  O'-1-1)! 


N-l( 

I  •  • 
1  . i=0L 

N-Q 


(A. 3. 19) 


The  probability  distribution  function,  Pfl^t},  of  the  system 

response  tic,  can  be  obtained  by  Integrating  the  probability  density 

given  by  Eq.  (4,3,17)  or  Eq.  (4,3,19)  from  0  to  t.  For  mrU,  noting 
first  that 


,-mXt  j 


J!  t 


J-k 


:nX  knO  (j-k)!  (mX)k 


■L._1L  «'mU  £  J!rk 

m1  (ml)^  isl  k_0  ki(mj)j"k 


J-lf  2,...  (4.3.20) 


the  integration  of  Eq.  (4.3.17)  gives 
v 
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Then,  noting  that 
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(4.3.21) 
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(4.3.22) 


the  third  (last)  term  of  Eq.  (4.3.21)  becomes 
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,  Mll  _  N-m-1 

-jysViv,  t 
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i=0  ^(N-m-l-i) ! (m-1 ) i+1  (N-m-l-i) !mi+l 


i  -mXt 
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(4.3.23) 
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Therefore,  the  probability  distribution  function  of  the  syst 
time  for  m/l  is  obtained  as 


system  response 


Pf  TrSt  } 


l-e”^t  _  t 
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where 


(4.3.24) 
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For  m-1,  the  integration  of  Eq.  (4.3.19)  similarly  givej 
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(4.3.25) 
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Thus,  we  have  derived  the  expressions  for  the  probability  density 
and  the  probability  distribution  function  of  the  response  time  of  a 
multiprogrammed  multi-processor  time-shared  virtual -memory  computer 
system.  The  result  of  the  probability  distribution  function  of  the 
system  response  time  enables  us  to  evaluate  the  percentile  response 
time  numerically,  as  will  be  carried  out  in  Chapter  5. 

Finally,  the  last  part  of  this  section  presents  two  somewhat  periph¬ 
eral  but  relnred  results;  a  result  of  the  response  time  distribution 
conditioned  on  a  job's  execution  time  and  a  result  of  the  first-time 
response  time  distribution  (to  be  defined).  Each  of  these  results  can 
be  obtained  by  a  slight  modification  of  the  analysis  of  this  section. 

We  have  assumed  until  now  that  execution  times  of  all  user  jobs  are 
only  probabilistically  (exponentially)  known.  Now  we  assume  that  the 
execution  time  of  a  particular  arriving  job  is  completely  known,  but  the 
execution  time  of  other  user  jobs  already  on  the  processing  system  are 
only  probabilistically  known  as  above.  Then,  the  response  time  (T^)  to 
be  experienced  by  a  user  who  has  requested  a  job  requiring  an  execution 
time  of  Tg  seconds  (excluding  all  the  overhead  times  to  be  added)  is 

given  by  the  sum  of  the  waiting  time  (T  )  in  the  job  queue  Q  and  its  own 

w 

execution  time  T  .  Letting  the  Laplace  transform  of  the  waiting  time  T 
e  w 

be  Pw(s),  we  have  (see  Eqs.  (A. 3. 5), (4. 3. 6), and  (4.3.7))  therefore  the 
following  expression. 
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(4.3.26) 


where  6(t)  is  a  delta  function  at  t=0.  Using  Eq.  (4.3.15),  this  can 
be  readily  inverted  as  follows: 
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(4.3.27) 

Therefore,  the  probability  density  of  the  system  response  time 
conditioned  on  a  job's  execution  time,  p{T^= t  |Te ) ,  is  given  by 
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(4.3.28) 


Thus,  if  the  execution  time  of  a  job  is  known  in  advance,  the  results 
given  by  Eqs.  (4.3.17)  and  (4.3.19)  must  be  modified,  as  given  by 
Eq.  (4.3.28).  The  corresponding  probability  distribution  function  and 
its  moments  can  be  easily  obtained  from  the  above  result. 

We  have  been  concerned  with  the  response  time  to  be  experienced  by 

each  of  N  interactive  terminal  users  who  are  currently  using  the  time- 

shared  computer  system.  One  may  ask  a  question  about  the  system 

response  time  to  be  experienced  by  a  new  user  who  is  about  to  Join  the 

existing  N  user  population  by  typing  the  "login"  command  at  his  terminal 

This  first-time  response  time  should  be  slightly  different  from  the  one 

with  which  we  have  been  concerned  because  such  a  new  user  joins  the 

system  independently  of  the  system  state  (e.g.,  the  number  of  queuing 

users).  Let  T  ,  (i.e.,  T  =  T  .)  be  the  execution  time  (excluding  all 

the  overhead  times  to  be  added)  of  the  "login"  command.  Then,  noting 

this  new  user's  random  arrival  to  the  system,  we  see  that  the  Laplace 

transform  P  (s)  of  the  probability  density  of  the  waiting  time  T 
w 

experienced  by  this  user  is  given  by  Eq.  (4.3.26)  with  replaced  by 
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Therefore,  the  probability  density  of  the  response  time  to  be  experienced 
by  the  login  command,  p{T  (login)  = t | T  },  is  obtained  as  follows: 

p{  (login)  =  t  |  T^} 


P{Tw(login)  =  t-Teji} 


(4.3.29) 


Thus,  the  probability  density  of  the  first-time  response  time  has  been 
obtained.  The  corresponding  probability  distribution  function  and  its 
moments  can  be  readily  derived  from  the  above  result. 


4,3.4.  Relationship  with  an  Infinite  Population  Model 

In  developing  queuing  models  for  time-shared  computer  systems, 
one  must  make  a  fundamental  choice  between  the  use  of  an  infinite  or 
finite  population  model.  It  is  clearly  stated  in  [M3]  that  the  finite 
population  model  is  much  more  appropriate  as  a  model  of  time-shared 
computer  systems.  The  basic  difference  resulting  from  the  size  of  the 
population  consists  in  the  fact  that  the  arrival  rate  of  the  finite 
population  model  is  dependent  on  the  state  of  the  system  (e.g.,  the 
number  of  queuing  users)  while  the  arrival  rate  of  the  infinite 
population  model  is  not;  the  arrival  rate  of  the  finite  population  model 
increases  (decreases)  if  the  number  of  queuing  users  decreases 
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(increases),  showing  the  existence  of  a  negative  feedback  mechanism  to 
stabilize  the  number  of  queuing  users,  but  the  arrival  rate  of  the 
infinite  population  model  is  always  constant.  In  view  of  the  fact  that 
the  infinite  population  model  is  however  oftei  used  in  modeling  computer 
systems,  we  will  make  a  brief  remark  about  the  relationship  of  these 
two  kinds  of  models  in  this  section. 

We  will  particularly  discuss  the  relationship  between  the  M/M/m 
queue  with  N  users  with  which  we  have  been  concerned  and  the  M/M/m  queue 
with  an  infinite  population  [C8,F1]  whose  arrival  rate  is  Pq(3  constant). 
For  such  an  infinite  population  model,  the  steady-state  probability  P(i) 
of  finding  i  jobs  on  the  system  at  the  instant  that  a  new  job  joins  the 
waiting  queue  is  obtained  [C8,F1]  as 


—  P<°) 


P(i)  = 


s  m  • 


m!m 


l-m 


P(0) 


ISiim 


m  s  i  < 00 


where  p  =  |Jq/ 1  ,  P(0)  = 


/m-1  pS  mpm 
I.  i?0  i!  '  m*  (m-p) 


-1 


(4.3.30) 


It  can  be  easily  shown  [FI]  that  as  N- '°°  and  n  ~ *  0  in  our  finite 
population  model  in  such  a  way7  that  N,a  remains  a  constant  given 

by  Eq.  (4.3.2)  approaches  P(i)  given  by  Eq.  (4.3.30).  But  for  the  series 
E(P(i)/P(0))  to  converge  we  must  have 

jtO-  =  "util ization" <  1  (4.3.31) 

ml 

Noting  that  Q  is  finite  under  this  condition. 
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lim  P. 

.  .  N-i 
=  lim  — —  n  • 
N—  N-TJ 

=  lim 

•N  -®  1 

N  •» 

p  ~*0 

p  ~*  0 

p  0 

Np=  PQ 

Np=  PQ 

(4.3.32) 


Therefore,  both  P.  and  *.  approaches  P(i)  as  N~°°  and  ^-0  under  the 
constraint  Np  =  pg.  This  implies  that  the  system  response  time  obtained 
for  the  finite  population  model  approaches  the  system  response  time  of 
the  infinite  population  model,  under  this  limit.  For  example,  the 
probability  density  of  the  system  response  time  given  by  Eq.  (4.3.19) 

approaches  the  corresponding  solution  of  the  infinite  population  model, 
i.e., 

p{Tr=t  }  «  (X-p0)e-(X^0>t  t*0  (4.3.33) 

In  the  case  of  the  infinite  population  model,  the  system  response  time 
is  exponentially  distributed,  as  above.  In  contrast,  the  probability 
density  of  the  system  response  time  obtained  using  the  finite  population 
model  of  this  section  (see  Eqs.  (4.3.17)  and  (4.3.19)  will  be  numerically 
found  in  Chapter  5  to  be  much  closer  to  a  normal  distribution  density 
if  the  number  (N)  of  interactive  users  is  large.  This  tendency,  usually 
observed  on  actual  computer  systems  [S6],  is  a  result  of  the  finiteness 
of  the  user  population. 
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4.4.  Some  Remarks  on  Modeling.  Errors 

We  will  finish  the  response  time  analysis  of  the  mult  ip rog rammed 
time-shared  computer  system  by  examining  the  modeling  errors  which  may 
be  introduced  to  this  analysis.  Looking  back  at  what  we  have  been 
doing,  we  have  developed  a  queuing  theoretic  model  of  the  cir.p'iter 
system  whose  characteristics  were  specified  in  Section  4.2  and  have 
analyzed  the  behavior  of  such  a  model  to  investigate  the  system  response 
time  of  the  computer  system  under  study.  We  will  not  consider  what 
happens  if  a  given  (actual)  system  is  slightly  different  from  the  one 
considered  in  Section  4.2  (the  reader  who  is  interested  in  this  subject 
should  read  D'Avanzo  [D4]),  but  will  be  concerned  with  the  errors 
possibly  introduced  in  approximating  such  an  actual  computer  system 
with  our  queuing  model  called  the  total  system  model. 

We  will  particularly  discuss  three  approximations  being  used  in  the 

analysis.  The  first  one  is  the  approximation  to  use  the  hypothetical 

system,  of  Figure  4-2,  whose  execution  speed  is  traded  for  the  percentile 

2 

throughput.  As  stated  in  the  previous  chapter,  (1-9)  x 10  percent  of 
the  computational  capacity  of  the  actual  computer  system  cannot  be 
utilized  for  users'  useful  computation;  this  proportion  cf  the  system's 
capacity  is  wasted  either  by  the  system  overhead  operations  or  by 
processor  idle  time.  If  the  length  of  each  burst  of  these  wastages  is 
comparable  to  the  length  of  the  execution  time  ci  each  job,  then  the 
speed-capacity  tradeoff  will  not  give  a  good  approximation.  (Consider 
the  variance  of  the  system  response  time,  for  example.)  But  fortunately, 
on  actual  computer  systems,  these  wastages  (e.g..  paging  overhead! 
usually  have  much  shorter  lengths  than  the  job  execution  times  and 
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moreover  tend  to  occur  uniformly  in  time.  Therefore,  this  approximation 
does  not  seem  to  introduce  a  significant  error  in  the  analysis. 

The  second  approximation  consists  in  the  use  of  the  effective 
percentile  throughput  0(N) .  Basically,  the  variable  9  system  (actual 
system)  is  approximated  by  the  constant  9  system.  This  implies  that  if 
9  of  the  actual  system  (N  =  fixed)  varies  very  much  in  time  this  approx¬ 
imation  tends  to  underestimate,  for  example,  the  90  percentile  response 
time.  If  that  is  the  case,  the  use  of  0X  for  0(N),  i.e.,  the  use  of  the 
largest  lower  bound  for  0(20,  will  give  a  good  upper  bound  estimate  of 
the  90  percentile  response  time.  This  kind  of  underestimation  may  be 
significant  especially  in  the  medium  load  range  of  N  because  of  relative¬ 
ly  frequent  fluctuations  of  0'  (see  Figure  4-1);  the  estimated  90 
percentile  response  time  of  the  heavy  load  or  the  light  load  range  of 
N  tends  to  be  accurate. 

The  third  approximation  consists  in  the  fact  that  user  jobs  are 
executed  up  to  completion  under  the  FCFS  discipline  on  the  hypothetical 
system  while  the  jobs  are  not  executed  strictly  in  this  way  on  an  actual 
multiprogrammed  system  using  demand  paging  even  if  all  the  scheduling 
disciplines  (the  ones  associated  with  the  memory  queue  and  the  processor 
queue)  are  FCFS.  To  see  this,  consider  a  situation  where  a  large  job 
and  a  small  job  enter  the  processor  queue  respectively  at  time  t  and 
t+At.  It  is  very  likely  that  the  small  job  which  arrived  at  the  process¬ 
ing  system  later  than  the  large  one  will  be  completed  earlier  than  the 
large  one,  because  the  small  one  is  likely  to  demand  a  smaller  number 
of  missing-pages  than  the  other.  This  means  that  a  job  which  arrives 
at  the  system  later  than  another  can  be  completed  earlier  or.  the  actual 
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multiprogrammed  system  under  study.  Therefore,  we  must  consider  the 
effect  of  this  kind  of  favoritism  for  short  jobs  upon  the  system 
response  time.  The  average  response  time  is  not  affected  because  the 
execution  time  is  assumed  to  follow  the  (memoryless)  exponential  distri¬ 
bution,  but  Its  variance  or  90  percentile  response  time  is.  The 
variance  of  the  system  response  time  is  basically  proportional  to  the 
sum  of  the  variances  of  job  execution  times  because  execution  times  are 
assumed  to  be  independent  of  each  other.  Considering  that  if  the  system 
is  under  a  heavy  load  the  number  of  multiprogrammed  jobs  is  constant  (q*> 
but  the  number  of  jobs  waiting  in  the  memory  queue  varies  in  time,  we 
see  that  the  variance  of  the  response  time  of  the  system  under  a  heavy 
load  is  determined  mainly  by  the  varying  number  of  the  jobs  in  theFCFS 
memory  queue.  Therefore,  this  argument  suggests  that  the  90  percentile 
system  response  time  tends  to  be  accurate  as  the  number  of  jobs  queuing 
on  the  processing  system  increases,  on  the  contrary,  if  there  are  only 
moderate  number  of  jobs  on  the  processing  system  ,  the  variance  of  the 
response  time  would  be  affected  by  the  favoritism  for  short  jobs:  the 
variance  (or  90  percentile  response  time)  predicted  by  the  total  system 
model  is  somewhat  smaller  (shorter)  than  that  of  the  corresponding 
actual  system  because  the  genuine  FCFS  scheduling  discipline,  used  in 
the  model,  is  known  to  attain  the  smallest  variance  of  response  times 

[  C  7  ] 


CHAPTER  5 


MODEL  VALIDATION,  PERFORMANCE  PREDICTION  AND 
OPTIMIZATION,  AND  CONFIGURATION  SELECTION 


5.1.  Introduction 

We  have  finished  the  development  and  analyses  of  all  the  hierarchi¬ 
cally  organized  modular  models  shown  in  Figure  1-4  that  we  intended  to 
Describe  in  this  thesis.  It  is  time  to  examine  the  validity  of  these 
models  and  to  consider  if  the  performance  questions  raised  in  Table  1.1 

of  Chapter  1  can  be  answered  by  a  series  of  these  analyses  given  in 
Chapters  2,  3,  and  4. 

We  will  first  examine  the  validity  of  the  processor  model  and  the 
total  system  model  by  comparing  the  system  performance  predicted  by 
these  -two  models  and  the  available  statistics  of  an  actual  computer 
system,  the  Multics  system  of  M.I.T.  This  validity  examination  of  the 
models  is  intended  to  present  a  rough  idea  about  the  accuracy  of  the 
system  performance  that  can  be  predicted  by  these  models.  Then,  noting 
that  the  performance  questions  of  Table  1.1  can  be  classified  into 


performance  prediction  problems  (the  second  through  the  fifth  questions), 
performance  optimization  problems  (the  sixth  through  the  eighth 
questions),  and  a  configuration  selection  problem  (the  first  question), 
we  shall  proceed  to  consider  each  class  of  problems  quantitatively  one 
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5.2.  Model  Validation 

In  this  section,  the  validity  of  the  models  developed  in  the 
preceeding  chapters  will  be  examined  by  comparing  the  performance  of  an 
actual  large-scale  time-shared  computer  system,  the  Multics  system  of 
M.I.T,  [C10,C12],  with  the  performance  result  that  can  be  obtained 
using  these  models.  The  instrumentation  used  in  the  development  of 
the  Multics  system  and  the  performance  statistics  obtained  from  this 
running  system  w'll  br  first  described  and  then  the  details  of  the 
statistical  results  will  be  presented. 


5.2.1.  Instrumentation  and  Multics  Performance 

The  Multics  system  is  a  large-scale  time-shared  computer  system 
developed  as  a  cooperative  effort  involving  the  Bell  Telephone 
Laboratories  (from  1965  to  1969),  the  computer  department  of  the 
General  Electric  Company  (subsequently  acquired  by  Honeywell  Information 
Systems  Inc.),  and  Project  MAC  of  M.I.T.  The  system  has  all  the 
features  of  modern  large-scale  time-shared  computer  systems  such  as 
paging,  segmentation,  multiprogramming,  multi-processing,  memory 
hierarchy,  and  so  on. 

The  current  standard  configuration  of  the  Multics  processing 
system  at  M.I.T.  includes  two  processors,  three  primary  memory  units 
(128  k  thirty-six  bit  words  per  unit),  and  the  secondary  memory  system 
consisting  of  drum  and  disk  memories,  with  five  or  six  eligible  user 
processes  under  multiprogramming  (see  Figure  5-1).  Occasionally,  a 
processor,  a  primary  memory  unit,  and/or  some  part  of  secondary  memory 


Figure  5-1  A  Typical  Multics  Processing  System  Configuration 
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is  removed  from  the  servicing  system,  for  maintenance  or  to  create 
another  system  for  debugging  a  new  version  of  the  operating  system  [C12]. 
Therefore,  the  Multics  system  has  been  running  wich  one  of  the  following 
hardware  configurations  during  the  past  year. 

(1)  large-scale  configuration  ...  standard  configuration^ 
(two-processor  three-primary-memory-unit  system) 

(2)  medium-scale  configuration 

(one-processor  three-primary-memory-unit  system) 

(3)  small-scale  configuration 

(one-processor  two -primary-memory-unit  system) 

In  this  way,  every  effort  has  been  made  to  support  the  continued  opera¬ 
tion  of  the  system. 

Early  in  the  design  stage  of  this  system,  it  was  felt  that  the 
cost  of  maintaining  well-organised  instrumentation  can  be  made  low  and 
the  payoff  in  being  able  to  "look  at  the  meters"  any  time  a  performance 
problem  is  suspected  is  very  high.  This  initial  conviction  resulted  in 
a  comprehensive  set  of  system  metering  commands  that  can  be  used  from 
any  terminal  [S2]  and  the  use  of  a  DEC  PDP-8  system  as  a  peripheral 
computer  to  test  and  monitor  the  operation  of  the  Multics  system  [G3,G4]. 
The  metering  commands  have  proved  to  be  extremely  useful  in  measuring 
the  paging  performance  of  programs,  the  secondary  memory  performance, 
the  processor  time  usage  (among  various  system  overhead  times,  idle 


Whenever  there  was  not  a  need  to  reduce  the  configuration,  this 
standard  configuration  of  the  Multics  system  has  been  used. 
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times,  and  users'  useful  computation  time),  and  user  behavior  character¬ 
istics.  The  use  of  the  PDP-8  computer  as  a  simulator  of  interactive 
users  [G3]  has  provided  system  developers  with  a  convenient  tool  to 
test  the  performance  of  a  newly  installed  version  of  the  operating 
system;  a  standard  benchmark  (or  a  series  of  commands  interspersed  by 
think  times  representing  a  typical  debugging  user  process)  has  been  run 
from  time  to  time  during  the  past  four  years  to  measure  the  execution 
time,  the  paging  performance,  and  the  response  time  of  each  command 
included  in  the  benchmark.  Thus,  these  tools  have  enabled  system 
analysts  to  obtain  the  operational  statistics  of  the  Multics  performance 
which  are  essential  in  improving  the  succeeding  versions  of  the  operating 
system. 

Many  performance  statistics  concerning  the  Multics  system  have 
been  accumulated  using  these  measurement  tools.  Three  sample  results 
each  representing  the  performance  of  a  different  hardware  configuration 
were  randomly  chosen  from  these  performance  statistics  to  show  the 
typical  performance  of  each  of  these  three  hardware  configurations. 

These  sample  results  are  given  in  Table  5-1.  bach  sample  result  was 
collected  from  the  system  during  a  thirty-minute  session  either  in  the 
morning  or  afternoon  of  a  normal  working  day  when  the  system  was 
operating  normally  and  was  fully  loaded  with  interactive  terminal  users. 
Although  the  numbers  given  in  Table  5-1  are  subject  to  statistical 
fluctuations,  it  is  felt  that  they  are  representative  f  typical 
performance  of  the  Multics  system  and  that  they  can  be  used  for  the 
purpose  of  examining  the  validity  of  the  models  developed  in  this  thesis. 
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Table  5-1  Typical 

Performance 

of  the  Multics 

System 

under  a 

Full  Load 

Configuration 

Small 

Med ium 

Large 

1  CPU 

1  CPU 

2  CPUs 

2  PM  units 

3  PM  units 

3  PM  units 

average  number  of 

eligible  processes 

5.7 

5.3 

5.6 

mtbpf  (msec) 

16.5 

29.7 

31.9 

mpft  (msec) 

32.4 

33.2 

30.9 

mean  paging  overhead 

5.4 

A  1 

time  (msec) 

9.0 

•processor  time  breakdown^ 

— 

-  —  _  . 

multiprogrananing  idle 

2.9  1 

1.8  7. 

8.8  7. 

memory  interference  idle 

j  0 

0 

5.7 

paging  overhead 

28.7 

18.4 

22.9 

miscellaneous  overhead 

16.6 

15.9 

12.6 

users'  useful  computation 

51.8 

63.9 

49.9 

number  of  users 

41 

43 

48 

♦  response  time  characteiTt  icsi 

— 

- - 

—  — * - - 

average  queue  length 

16.7 

14.9 

14.9 

average  response  time  to 

the  PDP8  user  simulator 
(sec) 


not  measured 


7.9 


6.0 


July  1971 
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Using  a  sec  of  metering  csmands ,  the  mean  paging  overhead  times 
(all  the  supervisory  operations  necessitated  by  «  page  fault  are  includ¬ 
ed)  of  a  single  processor  system  and  a  dual  processor  system  are 
respectively  found  to  be  approximately  6  and  9  milliseconds  long}  On 
the  other  hand,  the  value  of  the  miscellaneous  overhead  coefficient  (6) 
is  found  to  be  between  0.2  and  0.35,  with  a  typical  value  of  0.25.  The 
average  think  time  of  interactive  users  is  found  to  vary  from  session 
to  session  (e.g.,  12  to  35  seconds),  but  it  is  typically  15  to  25 
seconds  long.  These  numbers  represent  shorter  think  times  than  observed 
for  the  CTSS  system  by  Scherr  (S4).  The  average  execution  time  required 
by  an  interactive  Job  (excluding  all  the  overhead  times  to  be  added)  i6 
roughly  400  milliseconds  long. 

In  a  special  measurement  of  multi-processor  interference,  it  was 
found  that  if  two  processors  direct  their  accesses  to  a  particular 
primary  memory  unit  at  all  times  the  execution  time  of  a  typical 
program  is  stretched  by  about  twenty  percent.  This  means  that  the 
value  of  the  memory  interference  coefficient  of  the  Multlcs  system  is 
approximately  1.2,  i.e.,  Y=1.2.  On  the  other  hand,  it  has  been 
generally  observed  on  the  large-scale  configuration  (with  three  primary 
memory  units)  under  a  heavy  load  that  the  system  typically  loses  5  to  6 
percent  of  each  processor's  processing  tine  because  of  memory  cycle 


^Most  of  the  increase  of  the  paging  overhead  time  on  a  dual  processor 
system  is  due  to  the  data-base  lockout.  As  described  in  Table  3-1  of 
Chapter  3,  the  data-base  lockout  is  frequent  enough  to  prolong  signifi 
cantly  program  execution  time  only  in  the  page  fault  processing. 
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interference  and  another  5  to  6  percent  of  each  processor's  processing 
time  because  of  data  base  lockout.  Most  of  the  loss  due  to  data-tase 
lockout  is  being  caused  by  the  lockout  of  the  page  table. 


5.2,2.  Validation  of  Processor  Mode!. 

In  this  section,  we  examine  the  validity  of  the  processor  model  of 
Chapter  3  by  comparing  the  performance  of  the  above  three  configurations 
of  the  Multics  system  to  be  predicted  by  the  processor  model  and  the 
actual  performance  of  the  same  system  summarized  in  Table  5-i. 

For  this  purpose,  a  program  was  written  in  PL/I  to  derive  the 
performance  of  the  processing  system  under  investigation  using  the 
processor  model.  We  call  this  progran  the  throughput  analysis  program 
and  it  is  included  in  Appendix  A  together  with  an  explanation  about 
how  to  use  it  and  a  sample  console  session  using  thf.  program.  This 
program  derives  the  processor  time  breakdown  into  various  system  over¬ 
head  times,  idle  times,  and  users'  useful  computation  time  (i.e., 
percentile  throughput),  upon  specification  of  the  configuration  of  the 
processing  system  under  study. 

In  predicting  the  performance  of  each  of  the  above  three  hardware 
configuration  of  the  Multics  system,  all  the  input  parameters  (except 
the  degree  of  multiprogramming)  were  set  fairly  close  to  the  correspond¬ 
ing  measured  values  given  in  Table  5-1.  The  degree  of  multiprogramming 


This  observation  will  be  found  consistent  with  the  above  measurement 
result,  Y=1.2,  numerically  in  Section  5.2.2. 
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was  then  changed  to  see  the  effect  on  the  resulting  processor  time 
breakdown.  The  result  is  shown  in  Table  5-2  (a),  (b)  and  (c).  It  is 
clearly  seen  that  as  the  degree  of  multiprogramming  approaches  the 
measured  value  of  the  average  number  of  eligible  processes  of  Table  5-1 
the  resulting  processor  time  breakdown  becomes  very  similar  to  the 
measured  processor  time  breakdown  of  Table  5-1.  The  closest  result 
analytically  obtained  for  each  configuration  is  enclosed  by  a  broken 
line  in  Table  5-2.  For  example,  it  is  seen  that  the  processor  time 
breakdown  of  the  small-scale  configuration  predicted  by  the  processor 
model  becomes  very  close  to  the  measured  performance  of  this  configura¬ 
tion  given  in  Table  5-1  when  the  degree  of  multiprogramming  is  fixed 
at  either  5  or  6 .  The  performance  of  the  two-processor  configuration 
(i.e.,  the  large-scale  configuration)  is  also  predicted  fairly  accurate¬ 
ly  by  the  processor  model,  as  seen  in  Table  5-2  (c). 

As  a  result  of  comparing  the  predicted  performance  of  Table  5-2 
and  the  actually  measured  performance  of  the  Multics  system,  we  conclude 
that  the  processor  model  can  be  used  as  a  practical  tool  of  performance 
prediction  despite  its  simplified  abstraction  of  complex  structure  of 
actual  computer  systems  (see  Section  1.6). 


5.2.3.  Validation  of  Total  System  Model 

In  this  section,  we  examine  the  validity  of  the  total  system 
model  developed  in  Chapter  4  by  comparing  the  performance  of  the  three 
configurations  of  this  Multics  system  to  be  predicted  by  the  total 
system  model  and  the  actual  performance  of  the  corresponding  configura 
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Table  5-2  Validation  of  the  Processor  Model 


(a)  one-processor  two-primary-memory-unit  configuration,  i.e., 
small-scale  configuration 


degree  of  multiprogramming 

2 

3 

4 

5 

6 

multiprogramming  idle 

39.4 

20.5 

9.2 

3.5 

1.1 

memory  interference  idle 

0 

0 

0 

0 

11  \ 

paging  overhead 

19.8 

26.0 

29.7 

31.6 

32.4 

miscellaneous  overhead 

9.9 

13.0 

14.8 

15.7 

16.1 

users'  useful  computation 

30.9 

40.5 

46.3 

49.2 

50.4  ! 

mtbpf = 16.5  msec,  mpft  =  32 .4  msec,  t  ,=5.4  msec,  6=0.32 

pi 


(b)  one-processor  three-primary-memory-unit  configuration,  i.e., 
medium-scale  configuration 


degree  of  multiprogramming 

m§ 

2 

3 

n 

5  ; 

multiprogramming  idle 

53.8 

23.9 

8.5 

2.4 

0.6  1 

memory  interference  idle 

0 

0 

0 

0 

o  ; 

paging  overhead 

9.4 

15.5 

18.6 

19.8 

20.2  i 

miscellaneous  overhead 

7.4 

12.1 

14.6 

15.5 

15.8  ! 

users'  useful  computation 

29.4 

48.5 

58.3 

62.2 

63.4  , 

i 

mtbpf  =  30  msec,  mpft  =  35  msec 

5  tpl  =  6 

.  1  msec. 

5  =0.25 

(c)  two-processor  three-primary- 

■memory- 

unit  configuration,  i.e.. 

large-scale  configuration 

degree  of  multiprogramming 

2 

3 

4 

5 

multiprogramming  idle 

74.6 

48.4 

27.2 

13.0 

5.2 

memory  interference  idle 

0 

1.7 

3.5 

4.9 

5.7 

paging  overhead 

7.2 

14.1 

19.6 

23.2 

25.1 

miscellaneous  overhead 

3.6 

7.2 

10.0 

11.8 

12.8 

users'  useful  computation 

14.6 

28.7 

39.8 

47.2 

51.2 

mtbpf  =  31. 9  msec,  mpft  =  30.9  msec,  K.t  =9  msec,  6  =  0.25,  Y=1.2, 
p  =  0.1*  1  P1 


It  has  been  numerically  found  that  the  resulting  performance  is  rather 
insensitive  to  a  particular  set  of  values  of  "t.  and  p  if  "t.  /p=mtbpf 
is  constan'  (see  Section  3.4.1).  Therefore,  tfte  value  of  p^as  arbitrar¬ 
ily  chosen  to  be  0.1.  (An  explanation  of  these  input  variables  can  ce 
found,  for  example,  in  Appendix  A.) 
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tions  summarized  in  Table  5-1. 

For  this  purpose,  a  program  was  written  in  PL/I  to  evaluate  the 
response  time  characteristics  of  the  entire  computer  system  under 
investigation  using  the  total  system  model.  We  call  this  program  the 
response  time  analysis  program  and  it  is  included  in  Appendix  B  together 
with  an  explanation  about  how  to  use  it  and  a  sample  console  session 
using  the  program.  The  program  has  two  phases:  the  first  phase  which 
determines  the  effective  percentile  throughput  and  the  second  phase 
which  derives  the  resulting  response  time  distribution. 

Unfortunately,  all  of  the  data  required  to  validate  the  entire 
total  system  model  is  not  available  on  the  Multics  system.  For  instance, 
a  response  time  distribution  is  not  measured  on  the  Multics  system;  one 
of  the  metering  commands  is  capable  of  measuring  only  the  average  queue 
length  as  an  indicator  of  the  average  system  response  time  experienced 
by  a  population  of  interactive  users.  On  the  other  hand,  the  PDP-8 
user  simu'-lator  can  measure  the  response  time  distribution  for  a 
particular  benchmark.  Therefore,  the  average  response  time  measured  by 
the  user  simulator  represents  the  average  response  time  experienced  by 
a  population  of  all  users  only  roughly.  As  for  the  input  parameters 
needed  by  the  response  time  analysis  program,  the  values  of  0^(percen- 
tile  throughput  of  the  system  under  multiprogramming  of  degree  q(l£q£q*)) 
are  not  all  measured;  only  the  value  of  o(N)  can  be  measured  by  a  meter¬ 
ing  command  (note  that  6(N) ^ 9^*  under  a  heavy  user  load).  Therefore, 
we  can  at  best  examine  the  validity  of  the  latter  phase  of  the  total 
system  model  partially. 

In  predicting  the  response  time  characteristics  of  each  of  those 
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three  hardware  configurations  of  the  Multics  system,  the  actual  values 
of  the  percentile  throughput  and  the  number  of  users  were  supplied  as 
input  parameters  to  the  response  time  analysis  program.  The  average 
execution  time  needed  for  useful  computation  of  a  user's  job  was  chosen 
to  be  400  milliseconds  long.  The  average  think  time  of  users  was 
changed  to  see  the  effect  upon  the  resulting  average  queue  length  (of 
user  jobs)  and  average  response  time,  for  each  of  the  above  hardware 
configurations.  The  result  is  shown  in  Table  5-3  (a),  (b),  and  (c). 

The  predicted  performance  which  is  closest  to  the  actual  performance 
given  in  Table  5-1  is  enclosed  by  a  broken  line  for  each  configuration. 
It  is  observed  that  a  close  match  between  the  predicted  performance  and 
the  actual  performance  is  obtained  when  the  average  think  time  of  users 
is  chosen  to  be  14  to  20  seconds  long.  This  result  of  user  think  time 
is  consistent  with  a  general  observation  that  the  typical  think  time  of 
the  Multics  system  is  approximately  15  to  25  seconds  long.  However, 
unavailability  of  a  metering  command  for  the  measurement  of  users'  think 
time  for  each  experiment  prevents  us  from  examining  more  details  of  the 
validity  of  the  total  system  model. 
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Table  5-3  Validation  of  the  Total  System  Model 


(a)  one-procec sor  two-primary-memory-unit  configuration,  i.e., 
small-scale  configuration. 


average 

think  time 
_ (see) 

16 

r- - - - — 

!  18 

1 

20  ; 

1 

22 

average 

queue  length 

20.3 

1 

;  17.7 

» 

15.1  ' 

12.7 

average 

response  time 
(sec) 

15.7 

1  13.7 

11.7  ! 

9.8 

percentile  throughput = 51 . 8  %  ,  average  execution  time  of  a  job =0.4  sec. 


number  of  users  =  41. 


(b)  one-processor  three-primary-memory-unit  configuration,  i.e., 
medium-scale  configuration. 


average  think  time 
(sec) 

16 

i  to 

20 

-  \ 

1 

t 

J„ 

22 

average  queue  length 

i 

■  14.3 

| 

11*4 

—f— 

J 

8.9 

average  response  time 
(sec) 

L.  - 

7.2 

1 

|> 

5.7 

percentile  throughput = 63.9  %  ,  average  execution  time  of  a  job  =0.4  sec, 
number  of  users  =  43. 


(c)  two-processor  three-pr imary-memory-unit  configuration,  i.e., 
large-scale  configuration. 


average 

think  time 
(sec) 

1 

i  14  i 

i  i 

:  1 

16 

18 

20 

average 

queue  length 

ISH 

9.4 

6.6 

4.8 

average 

response  time 

;  5.4  i 

3.9 

2.? 

2.2 

(sec) 

i _ i 

percentile  throughput = 49 .9  %  ,  average  execution  time  of  a  job  =0.4  sec, 
number  of  users  =  48. 


■N 
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5.3.  Performance  Prediction 

It  is  now  fairly  reasonable  to  expect  that  the  models  developed  in 
this  thesis  can  serve  as  a  practical  tool  to  predict  the  performance  of 
a  system  in  question  whose  configuration  is  specified.  Therefore, 
we  shall  proceed  to  use  these  models  in  evaluating  the  effect  of  several 
important  system  parameters  upon  the  processor  time  breakdown  and  the 
system  response  time. 


5.3.1.  Effect  of  System  Parameters  upon  Throughput 

In  this  section,  we  consider  the  probelm  of  improving  the  percen¬ 
tile  throughput  of  the  large-scale  (standard)  configuration  of  the 
Mu 1 tics  system,  i.e.,  the  two-processor  three-primary-memory-unit 
configuration,  as  an  example  of  investigating  the  effects  of  several 
important  system  parameters  upon  its  percentile  throughput.  We  assume, 
as  our  starting  point,  that  the  percentile  throughput  of  this  configura¬ 
tion  (under  multiprogramming  of  degree  4)  is  47.2  percent,  as  shown  in 
Table  5-2  (c) .  For  the  sake  of  simplicity,  we  do  not  change  the  degree 
of  multiprogramming  in  this  section. 

There  are  several  possible  approaches  to  improve  the  percentile 
throughput  of  this  configuration.  For  example, 

(a)  Addition  of  one  more  128  k word  primary  memory  unit 

(b)  Enhancement  of  secondary  memory  speed 

(c)  Reduction  of  system  overhead  time 


Approach  (a)  aims  to  increase  the  mtbpf  of  user  processes  under  multi- 
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programming  by  increasing  the  amount  of  primary  memory  available  to 
each  eligible  user  process.  The  resident  Multics  supervisor  programs 
(including  the  I/O  buffer,  the  memory  space  required  by  the  page  table, 
etc.)  occupy  approximately  90  k words  of  primary  memory  space.  Then, 
using  the  linear  paging  model  of  Section  2.5.2.,  we  can  roughly  expect 
the  mtbpf  to  become  about  45  milliseconds  long  if  128  k words  are  added 
to  the  current  384  kword  primary  memory.  The  longer  mtbpf  naturally 
decreases  the  percentage  of  both  multiprogramming  idle  time  and  paging 
overhead  time,  and  therefore  increases  the  percentile  throughput. 

Approach  (b)  also  aims  to  reduce  the  mu1 tiprogramming  idle  time,  by 
having  a  shorter  mpft.  It  is  similarly  possible  to  improve  the  percen¬ 
tile  throughput  of  the  system.  The  shorter  mpft  is  usually  attained  by 
replacing  the  existing  secondary  memory  device  by  a  faster  device. 
Another  way  which  is  applicable  to  a  rotating  device  like  a  drum  is  to 
create  multiple  copies  of  each  file  on  the  device;  the  first  copy  access 
ed  by  the  device  head  is  read  and  transferred  to  primary  memory,  saving 
some  part  of  the  device's  access  time.1  We  assume  that  the  current  mpft 
can  be  somehow  halved. 

On  the  other  hand,  approach  (c)  aims  to  increase  the  percentile 
throughput  not  by  decreasing  the  multiprogramming  idle  time  as  in  the 
above  two  approaches  but  by  reducing  the  system  overhead  time;  the 
multiprogramming  idle  time  will  not  be  affected  in  this  approach.  We 


1By  creating  multiple  copies  on  the  device  we  decrease  the  memory  space 
of  the  device.  This  time-space  tradeoff  approach  is  being  tested  on  the 
Multics  drum  for  the  purpose  of  reducing  the  mpft  of  the  secondary  memory  system. 
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assume  that  the  slow-down  factor  (K.)  due  to  data-base  lockout  can  be 

At 

reduced  frcn  (the  current  value  of)  1.5  to  1.2  by  shortening  each  lock¬ 
ing  period  of  shared  writable  data-bases;  this  means  that  the  mean 
paging  overhead  time  (K^t^)  would  be  approximately  7  1.2x6)  milli¬ 

seconds  long.  The  shorter  mean  paging  overhead  time  may  also  be  attain¬ 
ed  by  reprogramming  the  page  fault  handler.  Furthermore,  we  assume  that 
the  miscellaneous  overhead  coefficient  (6)  can  be  reduced  from  the 
current  value  (0.25)  to  0.2  by  reprogramming  the  miscellaneous  fault 
handlers. 

The  system  performance  resulting  f;om  each  of  these  three  approaches 
was  then  evaluated  using  the  throughput  analysis  program.  The  result  is 
summarized  in  Table  5-4.  The  expected  change  in  processor  time  usage  is 
clearly  seen  in  each  approach.  It  should  be  noted  that  approach  (a)  attains 
a  nearly  10  percent  improvement  in  percentile  throughput  (really  a  relative 
improvement  of  about  20  percent),  while  other  approaches  attain  a  5  to 
6  percent  improvement.  Howe’  er,  approach  (a)  may  involve  the  highest 
cost;  the  decision  of  choosing  the  right  approach  must  be  made  in 
consideration  of  the  cost-performance.  This  aspect  of  configuration 
selection  will  be  discussed  later  in  Section  5.5. 

Finally,  it  should  be  mentioned  that  there  are  other  approaches  to 
improve  the  system  throughput.  One  could  divide  the  primary  memory  into 
smaller  memory  units  in  order  to  decrease  the  percentage  of  memory 
interference  idle  time;  the  effect  of  having  more  memory  units  is  seen 
even  in  approach  (a)  where  the  number  of  memory  units  is  increased  only 
by  one.  Another  approach  may  be  to  add  another  processor  to  the  exist¬ 
ing  processing  system.  The  more  attractive  approach  which  does  not 
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Table  5-4  Alternative  Approaches  to  Improve  System  Throughput 


Configuration 

Current 

System 

Approach 

(a) 

Approach 

(b) 

Approach 

(c) 

No.  of  Processors 

2 

2 

2 

2 

No.  of  PM  units 

3 

4 

3 

3 

Degree  of  Multiprogramming 

4 

4 

4 

4 

mtbpf  (msec) 

31.9 

45.0 

31.9 

31.9 

mpft  (msec) 

30.9 

30.9 

15.5 

30.9 

Mean  Paging  Overhead  Time 

9 

9 

9 

7 

(msec) 

Misc.  Overhead  Coeff. 

0.25 

0.25 

0.25 

0.20 

Memory  Interference  Coeff. 

1.2 

1.2 

1.2 

1.2 

*  processor  time  usage  * 

Multiprogramming  Idle 

13.0  % 

7.0  % 

3.2  % 

13.0  % 

Memory  Interference  Idle 

4.9 

4.2 

5.9 

4.9 

Paging  Overhead 

23.2 

17.8 

25.7 

18.0 

Miscellaneous  Overhead 

11.8 

14.2 

13.1 

10.7 

Users1  Useful  Computation 

47.2 

56.9 

52.2 

53.4 
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involve  any  increase  of  system  cost  in  to  optimize  the  degree  of  multi¬ 
programming.  This  will  be  studied  in  detail  in  Section  5.4.1. 


5.3.2._  Effect  of  Percentile  Throughput  upon  Response  Time 

In  this  section,  we  examine  the  quantitative  effect  of  the  percen¬ 
tile  throughput  upon  the  response  time  characteristics  of  a  system  so 
that  we  can  determine  the  number  of  interactive  users  that  can  be 
supported  by  the  system  with  a  given  configuration. 

First,  in  order  to  present  a  rough  idea  about  the  response  time 
characteristics,  we  evaluate  only  the  average  response  times  of  the 
three  configurations  of  the  Multics  system,  for  the  range  of  the  number 
of  users  that  impose  a  heavy  load  upon  the  system.  For  this  purpose,  we 
need  to  specify  only  the  value  of  the  optimized  percentile  throughput 
of  each  configuration.  We  assume  therefore  that  the  small-scale, 
the  medium-scale,  and  the  large-scale  configurations  of  the  Multics 
system  have  respectively  the  optimized  percentile  throughputs  of  50,  65, 
and  50  percent.  Then,  noting  that  0(N)  -  6q*  in  the  above  range  of  the 
number  of  users  (see  Section  4.3.2.),  the  average  response  time  can  be 
obtained  by  Eq.  (4.3.9).  As  shown  in  Figure  5-7.  the  result  shows  that 
the  average  response  time  (Tf)  increases  almost  linearly  with  the 
number  (N)  of  users.  In  fact,  the  asymptote  for  the  average  response 
time  can  be  directly  obtained  from  the  result  derived  by  Scherr  [S4J  for 
his  simpler  model,  as  follows: 


averag 

e 

j 

riispOTr 

Lime 

GE 

■antflt-ecflln  / 

L'-crn  f  L  gu  lot  Lon/ 

g  ”0.50  / 

q*  j 

/ medium- sc 
/  configura 

/  C  .  *  0  AS 

■ac.  j  / 

' 

T  =*  20 

let.  /  / 

number  of  users 


Figure  5-2  Effect  of  Percentile  Throughput  upon  Average  Response  Time 


The  broken  line  part  of  each  curve  represents  a  lower-bound  estimate  of 
the  average  response  time.  Without  specifying  the  values  of  0q  for  all 
q  (isqsq*),  the  average  reponse  time  cannot  be  accurately  determined 

for  the  medium  or  light  load  range  of  N. 
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where  m,  Tg,  and  are  respectively  the  number  of  processors,  the 
average  execution  time  of  each  user's  job  (excluding  all  the  system 
overhead  times),  and  the  average  think  time  of  each  terminal  user.  It 
must  be  noted  that  the  slope  of  the  asymptote  is  inversely  proportional 
to  the  optimized  percentile  throughput.  This  means  that  maximization 
of  optimized  percentile  throughput  leads  to  minimization  of  the  average 
response  time. 

Next,  we  proceed  to  evaluate  the  distribution  of  response  time  for 
the  entire  range  of  the  number  of  users.  For  this  purpose,  we  need  to 
specify  all  the  values  of  0q  (Uqfiq*)  of  the  system  under  study.  As 
an  example,  we  consider  the  medium-scale  configuration  of  the  Multics 
system,  assuming  that 

q*  =  4 

(  Gr  G2’  63’  ®4  }  =  <°-55>  0.63,  0-65,  0.65) 

T  =  0.4  seconds.  T  =  20  seconds. 

tr  t 

Then,  using  the  first  phase  of  the  response  time  analysis  program,  the 
effective  percentile  throughput  0(N)  was  calculated,  as  shown  in  Table 
seen  tf at  ©(N)  ^  under  a  light  load,  but  as  interactive 
users  begin  to  impose  a  heavier  load  upon  the  system  0(N)  gradually 
approaches  0  .  in  Figure  5-3,  the  average,  10,  and  90  percentile 

response  times  obtainei  using  the  second  phase  of  the  response  analysis 
program  are  shown  as  a  function  of  the  number  of  interactive  users.  It 
is  observed  in  this  example  that  the  90  percentile  response  time  is 
about  twice  as  long  as  the  corresponding  average  response  time.  If  90 
percentile  response  time  most  be  kept  under  ten  seconds,  the  medium- 
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scale  Multics  configuration  can  support  at  most  39  interactive  users*. 

Finally,  the  probability  density  and  the  probability  distribution 
function  obtained  by  the  response  time  analysis  program  are  shown  re¬ 
spectively  in  Figures  5-4  and  5-5.  Figure  5-4  clearly  shows  that  the 
probability  density  gradually  flattens  out  as  the  number  of  interactive 
users  increases;  if  the  number  of  users  is  very  small  (e.g.,  ten  users) 
the  response  time  distribution  is  similar  to  the  distribution  of  a  job's 
execution  time  (i.e.,  an  exponential  distribution)  in  shape,  but  if  the 
number  of  users  is  large  enough  to  impose  a  heavy  load  upon  the  system 
(e.g.,  fifty  users)  the  response  time  distribution  is  more  like  a  normal 
distribution.  This  tendency  of  the  response  time  distribution  of  the 
Multics  system  has  been  actually  measured  by  the  PDP-8  user  simulator 
also.  Figure  5-5  gives  the  percentile  response  times  for  each  value  of 
the  number  (N)  of  users.  For  example,  it  is  seen  that  if  the  system 
has  40  interactive  users  80  percent  of  the  response  times  fall  between 
1.4  seconds  and  10.2  seconds  (or  below  8.5  seconds)  with  5.1  seconds  as 
a  median.  These  results  predicted  by  the  response  time  analysis  are 
generally  consistent  with  the  casually  observed  performance  of  the 
Multics  system. 


The  actual  system  of  thi  configuration  currently  (1971-72)  supports  up 
to  45  users.  But  these  numbers  may  change  with  time  as  the  system 
characteristics  change. 


Figure  5-5  Probability  Distribution  Functions  of  Response  Times 
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5.3.3.  Effect  of  User  Characteristics  upon  Response  Time 

An  interaction  cycle  of  a  terminal  user  consists  of  the  user's 
think  time  (T  ),  a  waiting  time  experienced  by  the  user's  job  in  the  job 
queue  of  the  computer  system  (W^),  and  an  execution  time  re¬ 
quired  by  the  user's  job  (T,).  Therefore,  the  user's  need  for  processor 
time  per  unit  real  time  is  given  by  T/^  +  T.).  The  system  response 
time  T  is  a  complicated  function  of  various  system  parameters  (in¬ 
cluding  T  and  Te)  as  seen  in  Chapter  4,  but  Tfc  and  T£  are  (purely)  user 
characteristics  and  are  both  prime  factors  determining  a  user's  need 

for  processor  time  per  unit  real  time. 

We  will  therefore  examine  the  effect  of  these  two  parameters 
concerning  user  characteristics  upon  the  average  response  time,  for  the 
range  of  the  number  of  users  that  impose  a  heavy  load  upon  the  system. 

In  particular,  we  will  consider  the  medium-scale  and  large-scale  configu¬ 
rations  of  the  Multics  system,  assuming  that  these  configurations  have 
65  and  50  percent  as  their  optimized  percentile  throughput  respectively. 

The  result  obtained  by  the  response  time  analysis  program  is  graph¬ 
ically  summarized  in  Figures  5-6  and  5-7.  From  these  figures,  it  becomes 
clear  that  a  twenty- five  percent  change  in  the  value  of  each  of  these 
user  parameters  produces  more  than  several  users  as  a  change  in  the 
number  of  interactive  users  that  can  be  supported  by  the  system  (use 
five- second  average  response  time  as  a  criterion,  for  example).  Further¬ 
more,  it  should  be  noted  that  in  this  heavy  load  range  of  N  the  response 
time  is  very  sensitive  to  these  two  parameters  (Tfc,  Tg)  and  the  number 
of  users.  This  indicates  the  importance  of  a  dynamic  load  controller 
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Figure  5-7  Effect  of  Execution  Time  of  User's  Job  upon 
Average  Response  Time 


l 


222 


5.4.  Performance  Optimization 

Now  we  shall  proceed  to  consider  the  performance  optimization 
problems,  i.e.,  the  problems  of  optimizing  the  throughput  of  a  given 
configuration  with  respect  to  certain  adjustable  parameters  of  the 
operating  system,  without  changing  the  hardware  configuration  of  the 
system  (the  system  cost  is  held  constant).  In  particular,  this  section 
presents  the  result  of  performance  optimization  of  the  degree  of  multi¬ 
programming  and  then  proposes  a  promising  approach  to  the  page  size 
problem  (the  problem  of  determining  the  optimum  page  size). 

5.4.1.  Optimization  of  Multiprogramming  Algorithm 

Assuming  that  a  hardware  configuration  of  the  system  under  study  is 
given,  this  section  is  concerned  with  the  problem  of  optimizing  the 
degree  of  multiprogramming  in  such  a  way  that  the  throu0nput  of  this 
configuration  is  maximized  under  a  heavy  load.  Because  the  system 
throughput  is  linearly  proportional  to  the  percentile  throughput,  the 
above  problem  is  equivalent  to  that  of  maximizing  the  percentile  through¬ 
put  by  choosing  the  optimum  degree  of  multiprogramming. 

As  suggested  in  Section  3.5.,  it  is  reasonable  to  expect  that  the 
percentile  throughput  as  a  function  of  the  degree  of  multiprogramming 
shows  a  uni-model  curve  like  the  one  shown  in  Figure  3-6;  if  the  degree 
of  multiprogramming  is  too  small  the  multiprogramming  idle  time  would 
dominate  the  processor  time  breakdown  and  if  it  is  too  large  the  paging 
overhead  operations  could  dominate  the  processor  time  breakdown.  We 
will  investigate  the  performance  of  a  particular  configuration,  i.e., 
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a  two-processor  three-primary-memory-unit  configuration  with  the  follow¬ 
ing  characteristics. 


mean  page  fetch  time  (mpft)  =  35  msec 
mean  paging  overhead  time  (K^t^)  =  7  msec 
miscellaneous  overhead  coefficient  (6)  =  0.25 
memory  interference  coefficient  (y)  =  1.2 
missing-page  probability  (p)  =0.1 


We  assume  a  particular  linear  paging  behavior  of  programs  obtained  in 
Table  2-2  of  Chapter  2.  All  of  the  three  cases  concerning  sharing 
included  in  Table  2-2  are  considered.  Those  three  cases  are: 


Case  1:  no  sharing  of  non-resident  programs 

(a  =  0 . 1,  b  =  0) 

Case  II:  10  percent  sharing  of  non-resident  programs 
(a  =  0 . 1,  b  =  0.1) 

Cast  III:  20  percent  sharing  of  non-resident  programs 
(a  =  0 . 1,  b  =  0.2) 

Changing  the  degree  (q)  of  multiprogramming,  the  processor  time  break¬ 
down  of  the  above  configuration  was  repeatedly  computed  using  the 
throughput  analysis  program,  for  each  of  these  three  cases  concerning 
sharing . 

The  result  is  summarized  in  Table  5-6.  It  is  clearly  seen  in  each 
case  that  the  percentile  throughput  as  a  function  of  q  shows  a  uni-model 
curve.  The  multiprogramming  idle  time  and  the  paging  overhead  time  are 
the  two  major  components  determining  this  uni-modal  tendency  of  the 
percentile  throughput.  For  case  I,  the  optimum  degree  of  multi- 
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Table  5-6  Optimization  of  Percentile  Throughput  of  a  Lual 
Processor  Configuration 

Case  I:  No  Sharing  of  Non-Resident  Programs 


degree  (q)  of 
multiprogramming 

2 

3 

4 

© 

6 

7 

8 

(1) 

7.6 

12.6 

17.3 

21.5 

25.2 

28.4 

31.1 

(2) 

10.3 

11.7 

12.4 

12.6 

12.6 

12.4 

12.1 

(3) 

38.1 

24.6 

15.9 

10.0 

6.0 

3.5 

1.9 

(4) 

2.4 

3.7 

4.6 

5.2 

5.6 

5.8 

6.0 

(5) 

41.4 

47.1 

49.6 

[mi 

50.4 

49.7 

48.7 

Case  II:  10  Percent  Sharing  of  Non-Resident  Programs 


degree  (q)  of 
multiprogramming 

2 

3 

4 

5 

© 

7 

8 

(1) 

7.4 

12.0 

16.0 

19.3 

22.0 

24.0 

25.7 

(2) 

10.5 

12.2 

13.1 

13.5 

13.6 

13.6 

13.4 

(3) 

37.1 

22.6 

13.2 

7.3 

3.7 

1.7 

0.7 

(4) 

2.4 

3.9 

4.8 

5.4 

.  5.8 

6.0 

6.1 

(5) 

42.3 

49.1 

52.6 

54.2 

f54.6l 

54.4 

53.8 

Case  111:20  Percent  Sharing  of  Non-Resident  Programs 


degree  (q)  of 
multiprogramming 

2 

3 

4 

5 

6 

© 

8 

(1) 

7.2 

11.3 

14.7 

17.1 

18.9 

20.1 

21.0 

(2) 

10.8 

12.8 

13.8 

14.3 

14.5 

14.5 

14.4 

(3) 

36.0 

20.5 

10.8 

5.1 

2.2 

0.8 

0.2 

(4) 

2.5 

4.1 

5.1 

5.6 

6.0 

6.1 

6.2 

(5) 

43.3 

51.2 

55.4 

57.5 

58.2 

"58.  J1 

57.9 

i 


□pft  =  35  msec,  mean  paging  overhead  time  (K.t  )  =  7  msec,  6  =  0.25  v-1  2 

p  =  0. 1 . 

processor  time  breakdown:  (1)  Paging  Overhead,  (2)  Miscellaneous  Over- 
head,  (3)  Multiprogramming  Idle,  (4)  Memory  Interference  Idle,  (5)  Users' 
Useful  Computation  (all  in  percent). 
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programming  is  obtained  to  be  five  (q* = 5)  with  the  maximized  percentile 
throughput  amounting  to  50.5  percent  (0 =0.505).  It  is  seen  that  the 
percentile  throughput  can  be  improved  by  nearly  10  percent  50. 5-41. 4) 

by  using  the  optimum  multiprogramming  mechanism  instead  of  the  uni¬ 
programming  mechanism  (i.e.,  q  =  2).  By  comparing  the  results  of 
the  above  three  cases  concerning  sharing,  the  effect  of  sharing  upon 
the  optimized  percentile  throughput  is  clearly  seen;  the  gain  due  to 
twenty-percent  sharing  is  approximately  8  percent  in  absolute  percentile 
throughput.  It  is  also  observed  that  as  the  degree  of  sharing  increases 
the  optimum  degree  of  multiprogramming  tends  to  increase.  This  tendency 
is  due  to  the  fact  that  the  larger  degree  of  multiprogramming  will  not 
shorter  the  mtbpf  of  the  eligible  user  processes  too  much  if  there 
exists  a  considerable  amount  of  sharing  among  these  processes.  Thus, 
the  optimum  degree  of  multiprogramming  for  the  above  two-processor 
configuration  was  found  to  be  five  to  seven,  for  the  assumed  program 
behavior.  This  result  is  surprisingly  cons  stent,  with  the  fact  that 
the  Multics  system  with  the  same  hardware  configuration  is  tuned  to 
t.llow  five  to  six  user  processes  to  be  eligible  with  its  dynamic  multi¬ 
programming  algorithm. 

Finally,  in  order  co  examine  the  effect  of  the  number  of  processors 
upon  the  optimization  of  the  multiprogramming  algorithm,  the  performance 
of  a  single  processor  configuration  was  evaluated.  The  system  charac¬ 
teristics  were  similarly  specified  as  mpft  =  35  msec,  6  =  0.25,  Y=1.2, 
and  t  t  6  msec  .  The  same  linear  paging  behavior  of  programs, 
with  a=b=0.1  (ten-percent  sharing),  wr.s  assumed.  The  result  is 
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summarized  in  Table  5-7.  It  is  seen  that  the  optimized  percentile 
throughput  amounting  to  65.5  percent  is  attained  when  the  degree  of 
multiprogramming  is  chos,n  to  be  three.  Comparing  this  result  with  the 
perform  nee  of  the  dual  processor  configuration  that  we  have  considered, 
we  find  that  an  addition  of  a  processor  to  this  single  processor  con¬ 
figuration  would  make  the  system  throughput  1.67  (  =  2  x0.546  /0.655) 
times  as  large  as  that  of  the  original  single  processor  configuration; 
apparently,  the  system  throughput  cannot  be  doubled  because  there  exists 
multi-processor  interference  (note  that  the  size  of  primary  memory  is 
384  k words  for  both  configurations.).  All  of  these  observations  are 
again  surprisingly  consistent  with  the  actual  measurement  results  of  the 
Multics  system  both  qualitatively  and  quantitatively. 

However,  a  comment  is  in  order  about  the  optimum  degree  of  multi¬ 
programming.  A  considerable  difference  is  seen  between  the  optimum 
degrees  of  multiprogramming  of  the  two  corresponding  (single  processor 
and  dual  processor)  configurations.  This  means  that  the  number  of 
processors  plays  an  important  role  in  optimizing  the  multiprogramming 
algorithm.  On  the  other  hand,  the  current  Multics  operating  system 
determines  the  degree  of  multiprogramming  using  only  a  (crude)  working- 
set  estimate  cf  user  processes  against  the  available  size  of  primary 
memory;  the  Multics  system  is  currently  tuned  to  allow  as  many  as  five 
or  six  user  processes  under  multiprogramming  on  both  the  large-scale  and 
the  medium-scale  configurations.  (Note  that  both  configurations  have 
the  same  amount  of  primary  memory.)  Therefore,  our  observation  about 
the  role  of  the  number  of  processors  in  optimizing  the  system  throughput 
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suggests  a  possible  change  of  the  Multics  multiprogramming  algorithm 
such  that  the  number  of  processors,  i.e.,  a  source  of  raw  computing 
power,  is  also  considered  in  its  working-set  strategy  in  determining  the 
degree  of  multiprogramming. 

Thus,  we  have  evaluated  the  percentile  throughput  of  a  given  hard¬ 
ware  configuration,  for  each  degree  of  multiprogramming,  in  this  section. 
This  result  makes  it  possible  to  derive  the  response  time  distribution 
of  this  configuration,  as  the  next  step.  In  fact,  we  have  already 
evaluated  the  response  time  distribution  of  the  optimized  single 
processor  configuration  that  we  have  just  analyzed,  earl  .er  in 
Section  5.3.2.  (See  Figures  5-3,  5-4,  and  5-5.) 


5.4.2.  Optimization  of  Page  Size 

In  this  section,  we  wi!1  consider  the  page  size  problem,  i.e.,  the 
problem  of  determining  the  optimum  page  size.  This  problem  is  well 
discussed  in  Denning's  tutorial  paper  on  virtual  memory  [D2].  The 
author  claims  that  a  choice  of  page  size  should  be  made  considering 
memory  fragmentation  and  efficiency  of  page- transfer  operations  between 
primary  and  secondary  memories.  Consideration  of  memory  fragmentation 
suggests  the  use  of  small  page  size  (e.g.,  45  words)  and  that  of  page- 
transfer  efficiency  suggests  the  use  of  large  page  size  (e.g.,  10  -  10* 
words).  The  author  however  does  not  present  any  method  to  find  a  trade¬ 
off  Detween  these  two  conflicting  factors  which  may  give  the  optimum 
page  size.  It  is  apparent  that  there  existed  no  vehicle  to  accomodate 


229 


these  two  conflicting  factors  in  a  unifying  framework  which  allows  <e 
determination  of  the  optimum  page  size.  However,  the  framework  of 
performance  evaluation  developed  in  this  thesis  fortunately  provides  js 
with  such  a  vehicle. 

We  assume  that  the  configuration  of  the  system  is  completely 
specified  except  the  page  size,  and  attempt  to  evaluate  the  effect  of 
page  size  upon  the  percentile  throughput  of  this  system  so  that  we  can 
derive  the  page  size  which  maximizes  the  throughput  of  the  system  under 
study. 

Two  types  or  memory  fragm  .atlon  exist  on  a  virtual  memory  comput¬ 
er  system  using  paging:  internal  fragmentation  and  table  fragmentation. 
The  for*  Dr  represents  the  wasted  space  in  the  last  page  of  each  segment 
(the  memory  requirement  of  each  segment  must  be  rounded  up  to  an 
integral  number  of  pages)  and  the  latter  represents  the  space  required 
by  storing  the  page  table  in  the  area  of  primary  memory  reserved  for 
the  resident  supervisor.  Memory  fragmentation  reduces  the  size  of 
primary  memory  available  to  non-resident  programs  (see  Figure  2-1),  and 
therefore  paging  activities  of  user  processes  may  be  intensified.  Given 
the  page  size,  it  is  straightforward  to  estimate  the  total  wastage  of 
primary  memory  space  due  to  both  types  of  memory  fragmentation. 

The  effect  of  page  size  upon  page-transfer  operations  can  be 
measured  in  the  page  fetch  time  associated  with  each  secondary  memory 
device.  For  rotating  devices  like  drums  and  disks  the  effect  of  page 
size  upon  the  page  fetch  time  is  rather  slight  because  of  their  relative¬ 
ly  long  access  time,  but  for  a  bulk  core  memory  used  as  secondary  memory 
the  page  fetch  time  is  linearly  proportional  to  the  page  size  [D2], 
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On  the  other  hand,  it  is  clear  from  the  result  of  Section  2.3  of 
Chapter  2  that  programs'  paging  behavior  is  also  affected  by  the  page 
size.  Therefore,  we  must  consider  the  effect  of  page  size  upon  the 
mhbpf  (or  mtbpf)  of  user  processes,  besides  those  two  factors  mentioned 
by  Denning.  It  was  seen  in  Section  2.3  that  the  smaller  page  size 
yields  a  longer  mhbpf  in  a  program's  steady-state  behavior.  However,  in 
its  transienc-state,  i.e.,  while  the  program  has  not  yet  fetched  pages 
that  are  necessary  for  a  sound  progress  of  its  computation  into  .primary 
memory,  the  smaller  page  size  would  yield  a  shorter  mhbpf.  For  example, 
if  we  measure  the  mhbpf  of  a  process  during  the  period  which  terminates 
at  the  moment  when  all  the  page- frames  available  to  the  process  become 
occupied  ty  its  pages  in  a  partitioned  primary  memory,  then  the  mhbpf  on 
the  system  with  the  halved  page  size  would  be  roughly  half  of  the  mhbpf 
on  the  original  computer  system.  This  argument  suggests  that  the  larger 
page  size  would  be  favorable  to  programs  with  relatively  short  execution 
times  which  can  operate  comfortably  in  a  relatively  large  primary  memory 
space;  on  the  other  hand,  the  smaller  page  size  would  be  favorable  to 
programs  with  relatively  long  execution  times  which  must  operate  within 
a  relatively  small  primary  memory  space. 

Unfortunatelv,  none  of  the  program  models  developed  in  this  thesis 
can  quantitatively  evaluate  the  effect  of  page  size  upon  the  overall 
mhbpf  of  user  processes.  The  experimental  result  obtained  by  Baer  [Bl] 
indicates  that  mhbpf  is  maximum  wnen  the  page  size  is  128  to  256  words. 
However,  interpretation  of  "numbers"  obtained  in  experimental  studies  is 
(at  best)  very  tricky  because  all  the  conditions  of  these  studies  are 
not  explicit;  it  is  much  more  difficult  to  derive  a  parameterized 
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expression  of  mhbpf  (which  can  be  applied  to  any  other  system)  as  a 
function  of  page  size,  memory  size,  and  program  characteristics,  from 
the  experimental  studies.  Therefore,  a  comprehensive  analytical  model 
is  much  desired  in  this  field.  In  this  respect,  a  program  behavior 
model  of  Woolf  [W3]  is  worthy  of  attention,  but  the  validity  of  this 
model  has  not  been  examined. 

Thus,  if  the  overall  mhbpf  (or  mtbpf)  can  be  properly  evaluated  as 
a  function  of  those  system  parameters,  we  can  evaluate  the  effect  of 
page  size  upon  the  percentile  throughput  of  the  processing  system, 
using  the  analysis  results  of  mpft  (of  secondary  memory)  and  memory 
fragmentation  as  well  as  that  of  the  overall  mhbpf  of  user  processes. 
Therefore,  it  would  be  possible  to  determine  the  page  size  which  maxi¬ 
mizes  the  throughput  of  the  computer  system  under  study. 
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5.5.  Configuration  Selarcirm 

Configuration  selection  is  the  problem  of  selecting  the  configure- 
tion  whrch  attains  the  best  performance  for  a  given  (purchase  or  rental) 
budget.  By  now,  a  solution  to  this  problem  may  be  clear. 

The  process  of  selecting  the  best  performanc.  configuration  within 
a  given  budget  is  schematically  depicted  in  Figure  5-8.  This  process 
essentially  contains  the  following  three  stages: 

(1)  Choice  of  a  hardware  configuration 

(2)  Performance  optimization  for  this  configuration 

(3)  Application  >f  a  decision  rule  to  the  cost-performance  of 
each  configuration 

In  the  first  stage,  a  particular  hardware  configuration  is  chosen  from 
a  finite  set  of  possible  hardware  configurations  of  the  processing 
system,  as  a  candidate  for  the  best  configuration.  Different  hardware 
configurations  represent  different  number  of  hardware  components 
(processors  and  primary  memory  units),  different  site  of  primary  memory, 
different  speed  of  hardware  components  (processors,  primary  memory,  and 
secondary  memory)  and  so  on.  In  the  second  stage,  the  percentile 
throughput  of  the  chosen  hardware  configuration  is  optimized  with 
respect  to  certain  adjustable  parameters  of  the  operating  system  (e.g., 
the  degree  of  multiprogramming,  the  page  site,  etc.).  The  optimited 
(hardware-software)  configuration  as  well  as  its  performance  is  recorded 
as  the  best  performance  which  can  be  obtained  from  that  hardware  con¬ 
figuration.  Repeating  these  two  stages  for  each  hardware  configuration 
whose  cost  is  within  the  budget,  we  can  obtain  the  maximised  performance 
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for  each  candidate  hardware  configuration  of  the  processing  system. 

When  all  the  candidate  configurations  of  the  processing  system  are 
evaluated  in  the  above  optimization  stage,  the  result  is  passed  over  to 
the  third  and  last  stage  of  the  configuration  selection  process.  In 
this  stage,  the  best  configuration  is  selected  according  to  a  decision 
maker's  formula  concerning  the  cost-performance  of  the  system;  the 
decision  maker  may  use  a  simple  ratio  of  cost  and  performance  or  his  own 
complicated  formula  concerning  cost  and  performance.  This  decision 
process  is  entirely  up  to  the  decision  maker.  When  he  has  selected  a 
particular  configuration  in  this  way,  he  obtains  the  best  configuration 
of  the  system  under  study  which  is  within  a  given  budget. 


CHAPTER  6 


CONCLUSIONS 


Modern  large-scale  time-shared  computer  systems  have  become  so 
complicated  in  their  structure  and  performance  that  human  intuition 
often  cannot  foresee  the  impact  of  a  small  change  in  their  structure 
upon  their  performance.  This  thesis  has  focused  its  attention  on  the 
statistical  performance  of  these  systems;  the  system  throughput  and  the 
system  response  time  have  been  selected  as  the  performance  measures  of 
these  systems.  Architects  and  designers  of  these  computer  systems  have 
encountered  many  performance  questions  that  they  found  very  difficult  to 
answer  quantitatively  (some  examrles  are  shown  in  Table  1-1).  They  did 
not  have  a  useful  tool  to  tackle  these  performance  questions;  they 
definitely  needed  a  comprehensive  structured  framework  of  performance 
evaluation  which  serves  as  an  aid  to  guide  their  intuition  in  understand¬ 
ing  variety  of  performance  evaluation  problems;  they  wanted  a  methodology 
of  performance  evaluation  which  can  quantitatively  answer  performance 
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questions  such  as  those  given  in  Table  1-1. 

For  variety  of  good  reasons,  we  have  decided  to  explore  the  possi¬ 
bility  of  using  analytical  models  in  attacking  these  performance  evalua¬ 
tion  problems  earlier  in  this  thesis.  Because  these  modern  large-scale 
time-shared  computer  systems  involve  many  important  features  which  are 
believed  to  influence  their  statistical  performance  significantly,  we 
faced  a  dilemma  of  two  basically  conflicting  factors  concerning  analyti¬ 
cal  models  of  these  computer  systems:  a  multiplicity  of  important  system 
parameters  and  the  mathematical  tractability  of  these  models.  As  a 
solution  to  this  dilemma,  we  have  presented  an  approach  using  a  set  of 
hierarchically  organized  modular  models  (see  Figure  1-4);  by  developing 
a  model  for  each  subsystem  of  the  entire  computer  system  and  then 
combining  these  models  together  through  their  inter-relationships,  we 
can  realistically  model  actual  modern  computer  systems.  In  this 
approach,  we  can  evaluate  the  effect  of  a  small  change  in  one  subsystem 
(e.g.,  secondary  memory)  upon  the  overall  performance  measures  of  these 
systems  (throughput  and  response  time  distribution);  we  can  consider  all 
the  major  features  of  these  modern  computer  systems  such  as  paging, 
segmentation,  multiprogramming,  multi-processing,  memory  hierarchy,  etc.; 
we  can  consider  several  kinds  of  system  overhead  times  and  idle  times 
which  reduce  the  computational  capacity  of  these  computer  systems.  In 
modeling  these  modem  computer  systems,  we  have  however  abstracted  the 
actual  structure  of  these  systems  in  a  certain  way;  because  this  thesis 
is  concerned  with  how  a  system  configuration  (concerning  hardware, 
system  programs,  user  programs,  and  users)  affects  its  overall  perfor¬ 
mance,  only  the  system  parameters  which  are  believed  to  influence  the 


system  throughput  and  the  average  response;  time  were  considered  in 
tackling  performance  projection  problems  (the  problems  of  estimating  the 
performance  of  a  system  which  does  not  yet  exist)  of  these  computer 
systems. 

Performance  projection  problems  were  classified  into  (1)  perfor¬ 
mance  prediction  problems,  (2)  performance  optimization  problems,  and 
(3)  configuration  selection  problems,  in  this  thesis.  In  attacking  the 
performance  prediction  problems  of  these  computer  systems  with  an 
approach  using  a  set  of  hierarchically  organized  modular  models,  the 
thesis  has  proposed  a  particular  hierarchy  of  modular  models  depicted  in 
Figure  1-4.  The  hierarchy  contains  (1)  a  user  behavior  model,  (2)  a 
secondary  memory  model,  (3)  a  program  behavior  model,  (4)  a  processor 
model,  and  (5)  a  total  system  model.  Because  the  last  three  models  were 
particularly  felt  to  be  underdeveloped,  this  thesis  developed  stochastic 
models  in  these  three  areas.  Chapter  2  described  several  program 
behavior  models  which  evaluate  the  effect  of  various  important  system 
parameters  affecting  a  program's  paging  behavior  (measured  in  the  mean 
time  between  page  faults).  In  praticular,  the  macroscopic  paging 
performance  model  of  Section  2.5  turned  out  to  be  useful  in  the  sense 
that  the  analysis  result  of  this  model  can  be  easily  combined  into  the 
analyses  of  other  subsystems.  Chapter  3  presented  the  throughput 
analysis  of  the  multi-processor  multi-memory  processing  system,  i.e., 
the  derivation  of  the  processor  time  breakdown  for  a  processing  system 
of  a  given  configuration;  because  the  system  throughput  is  linearly 
proportional  to  the  percentile  throughput  of  the  system  (the  percentage 
of  the  system's  computational  capacity  used  for  users'  useful  work),  as 
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shown  in  Section  3.2,  the  system  throughput  can  be  directly  obtained 
from  the  result  of  the  processor  time  breakdown.  Chapter  4  then  proceed¬ 
ed  to  evaluate  the  response  time  characteristics  of  the  entire  computer 
system,  using  the  results  obtained  by  the  analyses  of  all  other  models 
in  the  hierarchy  of  models.  The  analysis  presented  in  this  chapter  has 
explicitly  derived  the  probability  distribution  of  response  time  of 
these  computer  systems;  from  this  result,  we  can  determine  the  number  of 
interactive  users  that  can  be  supported  by  the  system  of  a  given  configu¬ 
ration  with  an  assurance  that  the  90  percentile  response  time  is  less 
than  ten  seconds,  for  example. 

Finally  in  Chapter  5,  the  validity  of  the  processor  model  and  the 
total  system  model  was  examined  by  comparing  their  behavior  with  the 
behavior  of  an  actual  system,  i.e.,  the  Multics  system  of  M.l.T.  This 
comparison  has  shown  that  the  performance  predicted  by  these  analytical 
models  is  consistent  with  the  actual  performance  of  three  configurations 
of  the  Multics  system.  Then,  various  performance  prediction  problems 
and  performance  optimization  problems  such  as  those  mentioned  in  Table 
1-1  were  numerically  studied.  The  predicted  results  look  very  reason¬ 
able  and  the  result  of  optimization  concerning  the  degree  of  multi¬ 
programming  suggests  a  possible  change  of  the  current  Multics  multi¬ 
programming  algorithm.  At  the  end  of  Chapter  5,  the  configuration 
selection  problem  (the  problem  of  selecting  the  best  configuration \|or 
a  given  budget)  was  finally  considered. 

The  structured  framework  of  performance  evaluation  presented  in 
this  thesis  gives  system  analysts  a  vehicle  to  tackle  their  performance 
problems.  When  investigating  the  effect  of  a  certain  system  parameter 
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upon  the  performance  of  the  system  under  study,  a  general  approach 
suggested  by  this  thesis  research  is  to  examine  its  effect  on  all  the 
input  parameters  and  intermediate  performance  neasures  included  in  the 
hierarchy  of  models  shown  in  Figure  1-4;  system  analysts  can  then  make 
their  intuition  work  in  the  right  direction  in  figuring  out  the  effect 
of  that  system  parameter  upon  the  overall  system  performance.  A  good 
example  was  presented  '*1  Section  5.4.2,  when  we  considered  how  the 
effect  of  page  size  upon  the  throughput  of  the  system  under  investigation 
can  be  possibly  evaluated  in  an  effort  to  determine  the  optimum  page 
size  for  that  system. 

Many  numerical  results  concerning  processor  time  breakdown  and 
response  time  characteristics  of  the  computer  system  of  various  configu¬ 
rations  presented  in  Chapter  5  have  shown  that  a  set  of  analytical  models 
developed  in  this  thesis  is  capable  of  providing  reasonably  accurate 
answers  to  the  quantitative  performance  questions  with  which  computer 
architects  and  designers  must  cope;  these  models  can  answer  most  of  the 
basic  questions  concerning  the  throughput  and  the  response  time  of  multi- 
prograirmed  virtual-memory  time-shared  computer  systems  using  demand 
paging,  for  a  wide  range  of  system  configurations.  Therefore,  the 
author  of  this  thesis  believes  that  analytical  models  are  extremely 
useful  especially  in  tackling  the  performance  projection  problems  which 
computer  architects  and  designers  must  resolve  in  early  design  stages. 
Many  modern  large-scale  computer  systems  continue  to  evolve  both  in 
performance  and  in  supporting  hardware-software  structure.  For  these 
systems,  analytical  models  may  be  valuable  in  all  stages  of  their  life. 
However,  in  studying  some  of  the  detailed  problems  like  a  comparison  of 
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several  specific  resource  allocation  strategies  or  paging  performance  of 
computer  programs  of  acutal  systems,  simulation  approaches  would  be  more 
useful  in  practice.  Most  of  these  detailed  problems  usually  arise  later 
in  the  design  stage,  it  is  known  that  the  studies  of  these  detailed  problems 
in  simulation  approaches  tend  to  be  very  expensive.  If  this  is  the  case 
with  a  particular  problem,  an  analytical  approach  like  that  presented  in 
this  thesis  should  be  used  in  deriving  some  sub-optimal  solutions  to  the 
problem  under  investigation  which  are  to  be  studied  by  a  detailed 
simulation  approach,  in  order  to  reduce  the  operating  cost  of  the 
simulation  studies.  Measurement  of  actual  systems  is  almost  indispens¬ 
able  to  any  problem  as  a  source  of  information  verifying  the  expected 
performance  or  as  a  source  of  information  indicating  a  need  for  a  design 
change  of  the  computer  system  under  study. 

In  short,  the  approach  using  a  set  of  hierarchically  organized 
modular  models  is  usable  for  actual  performance  evaluation  problems. 

This  approach  guides  one's  inLuition  to  understand  the  cause-and-effect 
relationship  existing  in  a  complicated  structure  of  modern  large-scale 
computer  systems.  When  the  analyses  using  these  models  are  applied,  a 
quantitative  solution  with  a  reasonably  accurate  approximation  to  the 
performance  problem  concerning  system  configuration  can  be  obtained. 
Furthermore,  this  modular  modeling  approach  allows  one  to  use  partially 
available  information  (obtained  by  actual  measurement)  about  system 
performance  effectively  in  estimating  the  overall  performance  of  the 
system. 

Finally,  we  would  like  to  turn  our  attention  to  the  specific 
performance  problems  considered  in  this  thesis  to  find  the  problems 
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which  require  further  research.  In  the  area  of  program  behavior 
analysis,  it  is  generally  felt  that  more  measurements  of  paging  behavior 
of  actual  programs  are  needed.  Since  the  primary  memory  size  is  the 
single  most  influential  parameter  determining  the  paging  behavior  of 
programs,  more  measurement  examining  the  validity  of  the  linear  paging 
model  is  absolutely  necessary.  Measurement  of  sharing  is  virtually 
unexplored  and  needs  to  be  carried  out.  To  study  the  page  size  problem 
(the  problem  of  deriving  the  optimum  page  size)  fully,  we  need  a 
comprehensive  analytical  model  to  evaluate  the  effect  of  page  size  upon 
the  mean  time  between  page  faults,  as  discussed  in  Section  5.4.2.  Ir 
the  area  of  throughput  analysis,  it  was  observed  in  Chapter  3  that  the 
analysis  of  a  dual  processor  system  is  fairly  complicated  and  that  of  a 
many-processor  system  (e.g.,  a  ten-processor  system)  would  be  of 
staggering  complexity;  it  is  felt  t„at  we  must  somehow  develop  a  model 
of  many-processor  processing  systems  which  can  realistically  predict 
the  processor  time  breakdown  with  a  reasonably  amount  of  computation, 
because  we  see  the  current  trend  of  computer  design  moving  towards  many 
processor  large-scale  computer  systems.  Finally,  in  the  area  of 
response  time  analysis,  the  accuracy  of  percentile  response  time  should 
be  examined  more  throughly  against  actual  response  time  characteristics 
ot  real  computer  systems.  For  this  purpose,  more  data  concerning 
response  time  characteristics  of  actual  time-shared  computer  systems 

must  be  collected. 

Although  the  class  of  computer  system  configurations  (see  Figu~es 
1-1  and  1-2)  considered  in  this  thesis  is  fairly  general  in  structure, 
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it  is  almost  certain  that  w*  must  consider  other  classes  of  configura¬ 
tions  also  which  will  attract  more  attention  in  the  future.  For  example* 
distributed  (geographically  separated)  computers  may  become  more  attrac¬ 
tive  in  near  future  to  allow  reliability  and  scaling  up  of  size  (on  the 
order  of  more  than  ten).  A  constant  effort  must  be  made  to  develop 
modeling  techniques  which  can  realistically  evaluate  the  performance  of 
computer  systems  of  today  and  the  future. 


APPENDIX  A  THROUGHPUT  ANALYSIS  PROGRAM 


This  appendix  gives  an  explanation  of  the  throughput  analysis 
program  which  was  extensively  used  in  deriving  the  processor  tine 
breakdown  for  various  multi-processor  multi-memory  processing  systems 
under  multiprogramming  in  Chapter  5.  The  program  is  entirely  based  on 
the  processor  model  of  Chapter  3  and  uses  particularly  Eqs.  (3.3.4) 
through  (3.3.6)  and  Eqs.  (3.4.6)  through  (3.4.22).  The  source  program 
written  in  PL/1  and  a  sample  console  session  involving  analyses  of  a 
single  processor  configuration  and  a  dual  processor  configuration  are 
also  included  in  this  appendix. 

In  using  the  throughput  analysis  program,  the  configuration  of  a 
processing  system  under  investigation  must  be  interactively  specified 
from  a  terminal  of  a  time-shared  computer  system  on  which  this  program 
is  executed.  Thir  program  needs  the  following  input  parameters1  which 
.pecify  the  system  configuration. 

1.  number  of  processers  (m  ;  ma  1  or  2) 

2.  number  of  primary  memory  units  (n) 

3.  degree  of  multiprogramming  (q) 

4.  mean  page  fetch  time  (mpft) 

5.  mean  length  of  in-page  operation  (t.  ) 

in 

6.  missing-page  probability  (p) 

7.  mean  paging  overhead  time  (K.t  .) 

X  pi 

1 

Performance  evaluation  of  a  single  processor  processing  system  does  not 
require  all  of  these  input  data. 
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8.  miscellaneous  overhead  coefficient  (6) 

9.  memory  cycle  interference  coefficient  (Y) 


A  few  comments  may  be  in  order  about  some  of  these  inputs.  In  specify¬ 
ing  the  values  of  the  mean  length  of  in-page  operation  (t^  )  and  the 
missing-page  probability  (p),  it  should  be  remembered  that  these 
variables  automatically  determines  the  mean  time  between  page  faults, 
mtbpf  (1  CPU),  as  follows. 

mtbpf  (1  CPU)  =  £ in 

P 


It  has  been  found  through  the  experience  of  using  this  program  that  the 
resulting  performance  is  rather  insensitive  to  a  particular  set  of 
values  of  t  and  p  if  their  ratio,  i.e.,  mtbpf  (1  CPU),  is  constant. 
Therefore,  after  the  value  of  mtbpf  (1  CPJ)  is  carefully  selected, 
these  two  particular  values  can  be  fairly  arbitrarily  determined.  The 
input  representing  the  mean  paging  overhead  time  reflects  a  slow-down 
factor  concerning  data-base  lockout.  Therefore,  should  be  selected 
in  such  a  way  that 


k*c  ; 


=  1  for  a  single  processor  system  (m=l), 
for  a  dual  processor  system  (m  =  2). 

On  the  Multics  system,  the  value  of  is  approximately  1.5  for  m  =  2. 


The  typical  value  of  the  memory  cycle  interference  coefficient  of  the 
Multics  system  is  found  to  be  Y^l.20.  The  value  of  the  miscellaneous 
overhead  coefficient  of  the  same  system  is  typically  5^0.25. 

Upon  receipt  of  these  parameters,  the  program  proceeds  to  derive 


and  print  out  the  processor  Lime  breakdown  (into  various  system  over¬ 
head  times,  idle  times,  and  users'  useful  computation)  and  the  relative 
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processor  speed  (the  value  of  the  effective  slow-down  factor  due  to 
memory  cycle  interference,  K.  ,  for  a  particular  configuration  under 
study).  The  program  also  prints  out  the  steady-state  probabilities  of 
the  states  of  the  model  (for  the  definition  of  these  states,  see  Table 
3-2  or  Table  3-4),  as  an  additional  information.  The  sample  console 
session  given  in  Section  A. II.  more  concretely  demonstrates  how  the 
throughput  analysis  program  should  be  used.  The  lines  typed  by  a  user 
of  the  throughput  analysis  program  are  underlined  and  the  lined  typed 
by  the  system  are  not. 

The  throughput  analysis  program  is  written  in  PL/I,  as  shown  in 
Section  A. I.  Therefore,  this  program  can  be  transferred  to  other 
systems  after  only  a  few  slight  modifications;  if  the  program  is  to  be 
transferred  to  an  IBM  system,  ’’call  ioa_"  (or  ''call  ioa_$nnl")  and  "call 
read_list_"  statements  contained  in  the  program  must  be  replaced 
respectively  by  "put  list"  and  "get  list"  statements.  An  external 
routine,  "mlsq",  invoked  by  the  program  solves  a  system  of  linear 
simultaneous  equations,  and  is  documented  in  "System/360  Scientific 
Subroutine  Package  (PL/l)---  Program  Description  and  Operations  Manual" 
(H20-0586-0)  published  by  IBM. 


Source  Program 
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A. II .  A  Sample  Console  Session 


thruput: 


******************************************************* 
*  THROUGHPUT  ANALYSIS  * 

******************************************************* 

number  of  processors  =  1 

*w 

degree  of  multiprogramming  =  4 
mean  page  fetch  time  (in  msec)  =  35 

•v»v 

mean  time  between  page  faults  (in  msec)  =  3^ 

mean  paging  overhead  time  (in  msec)  *  6.1 

miscellaneous  overhead  coefficient  =  0.25 

paging  overhead  =.138411580 

miscellaneous  overhead  =.155476615 

multiprogramming  idle  =.0242053359 

memory  cycle  idle  =0. 

percentile  thruput  =.621906459 


***  Additional  Information  *** 


Qp 

CPU 

SM 

pi  (  0, 

0, 

4)  = 

.0242053359 

pi  (  0, 

1/ 

3)  = 

.08298^7244 

pi  (  1/ 

1, 

2)  = 

.213402154 

pi  (  2, 

1/ 

1)  = 

.365832269 

pi  (  3, 

1, 

0)  = 

.313570518 

sum  of 

pi 

(..  .  ) 

=  .993999993 

r  2243 

2 

.796 

58  +  29 
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t  h  r  u  pu  t 


*  THROUGHPUT  ANALYSIS  * 


number  of  processors  =  2 

number  of  primary  memory  units  =  3 

degree  of  multiprogramming  =  5 

.•\v\ 

mean  page  fetch  time  (in  msec)  =  30^.9 

mean  length  of  in-page  operation  (in  msec)  =  3.19 

missing-page  probability  =  Q.l 

mean  paging  overhead  time  (in  msec)  =  9.0 

miscellaneous  overhead  coefficient  =  0.25 


memory  cycle  interference  coefficient 

paging  overhead 
miscellaneous  overhead 
multiprogramming  idle 
memory  interference  idle 
percentile  thruput 


1^20 


=.251299866 

=.127883710 

=.0524190464 

=.0568625187 

=.511534840 


processor  slow-down  factor 

due  to  memory  interference  =1.07200967 


**  * 

Add  i  1 

t  i  ona  1 

Qp 

CPU 

SM 

Ml 

pi  ( 

3, 

2/ 

o. 

0) 

Pi  ( 

3/ 

2/ 

o. 

1) 

pi  ( 

2, 

2  / 

1, 

0) 

pi  ( 

2/ 

2  / 

1, 

1) 

pi  ( 

1, 

2/ 

2, 

0) 

pi  ( 

1, 

2/ 

2, 

1) 

Pi  ( 

o# 

2, 

3  / 

0) 

pi  ( 

o. 

2/ 

3/ 

1) 

pi  ( 

0, 

1, 

4, 

0) 

pi  ( 

0, 

0, 

5, 

0) 

Information  *** 


=  .103385237 
=  .0632515093 
=  .188467521 
=  .113935644 
=  .171850422 
=  .102678935 
=  .104922063 
=  .0613090484 
=  .0755611220 
=  .0146334857 


r  2  245 


7.683 
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APPENDIX  D  RESPONSE  TIME  ANALYSIS  PROGRAM 


Ihis  appendix  gives  an  explanation  of  the  response  time  analysis 
program  which  was  extensively  used  in  evaluating  the  response  time 
characteristics  for  various  systems  in  Chapter  5.  The  program  is 
entirely  based  on  the  total  system  model  of  Chapter  4  and  uses  partic¬ 
ularly  Eqs .  (4.^.2)  through  (4.3.4)  and  Eqs.  (4.3.9),  (4.3.19),  and 
(4.3.25).  The  source  program  written  in  PL/I  and  a  sample  console 
session  involving  an  analysis  of  a  single  processor  configuration  are 
also  included  in  this  appendix. 

The  response  time  analysis  program  has  two  phases;  the  first  phase 
which  estimates  the  effective  percentile  throughput  and  the  second 
phase  which  predicts  the  response  time  distribution.  In  using  this 
program,  the  configuration  of  the  system  under  study  must  be  specified 
from  a  terminal  of  a  time-shared  computer  system  on  which  this  program 
is  executed.  In  particular,  the  first  phase  of  this  program  requires 
the  following  input  parameters  which  specify  the  system  configuration. 

1.  number  of  processors  (m) 

2.  optimum  degree  of  multiprogramming  (q*) 

3.  percentile  throughput  (0^)  for  each  degree  (l£q*q*)  of 
multiprogramming 

4.  number  of  users  (N) 

5.  average  execution  time  of  a  user  job  (  Tg) 

6.  average  think  time  of  a  user  (Tt) 

Upon  specification  of  the  system  configuration,  the  program  proceeds  to 
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derive  the  effective  percentile  throughput  of  this  configuration.  The 
computed  result  ot  the  effective  percentile  throughput  is  printed  out 
as  a  console  output.  The  estimated  steady-state  probabilities  of  the 
states  of  the  model  (see  Figure  4-4,  for  the  definition  of  these  states) 
are  also  printed  out,  as  an  additional  Information. 

The  second  phase  of  this  program  requests  the  following  input 
parameters. 

1.  number  of  processors  (m) 

2.  effective  percentile  throughput  (0(N)) 

3.  number  of  users  (N) 

4.  average  execution  time  of  a  user  job  (T^) 

5.  average  think  time  of  a  user  (Tt) 

Upon  supply  of  these  inputs,  the  program  proceeds  to  evaluate  the 
response  time  distribution.  It  prints  out  both  the  evaluated  probability 
density  and  the  evaluated  probability  distribution  function  (i.e., 
cumulative  function)  of  the  system  response  time.  The  sample  console 
session  given  in  Section  B.II.  more  concretely  demonstrates  how  the 
response  time  analysis  program  should  be  used.  The  linei  typed  by  a 
user  of  this  program  are  underlined  and  the  lines  typed  by  the  system 
are  not. 

The  comment  on  program  transferability  of  the  throughput  analysis 
program  is  appropriate  also  for  this  program. 
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resp 


B.II. 


A  Sample  Console  Ses s ion 


**********1 


RESPONSE  T I  ME  ANALYSIS 


Do  you  want 
throughput? 


to  determine  the  effective  percentile 

-  type  yes  or  no  - 

yes 


***  Estimation  of  Effective  Percentile  Throughput 
number  of  processors  =  1 


*** 


degree  of  multiprogramming  =  4 


percentile  throughput  (th) 
mult i programming 


in  %  for  each  degree  (q)  of 


if  q  =  1  then  th  =  55 
if  q=2  then  th  ■  63 
if  q=3  then  th  «  65 
if  q=4  then  th  =  65' 

V-**  v 

number  of  users  =  30 

average  execution  time  of  a  job  in  seconds  =  0.4 
average  user  think  time  in  seconds  =  20 

iteration  estimate  of  effective  %  throughput 

1  .649999999 

2  .626669958 

3  .629028954 


effective  percentile 
average  queue  length 
average  response  tine 


throughput  =  6.29028954e-01 

3 . 5 2 0 3 1 3 8 0 p  + 0 0  jobs 

2 . 6 5 8 8 7 8 7 7e+00  seconds 
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***  Additional  Information  *** 


Pi  ( 0 )  =  .158077382 
Pi  (1)  ■  .150782293 
Pi(2)  =  .139029734 
pi  (3  )  =  .  123772760 
P<  (4  )  =  .106254710 
Pi(5)  =  .0878376858 
Pi (6)  =  .0698200641 
Pi(7)  =  .0532783591 
P i ( 8 )  =  .0389617118 
Pi (9)  =  .0272533605 
P f  CIO)  =  .0181969544 
Pi(ll)  =  .0115714571 
pi (12)  =  .00699038355 
Pi ( 13 )  =  .00400067121 
P i ( 14 )  =  .00216242540 
Pi (15 )  =  .00110007037 
Pi(10)  =  .000524651696 
Pi (17)  =  .000233538489 
P i ( 18  )  =  . 96529748e-04 

Pi ( 19  )  =  . 36830004e-04 

P  i  ( 20  )  =  .  12881125e-04 

P  i  ( 21 )  =  .  40955587e-05 

Pi  (22)  =  . 11719660e-05 

Pi ( 23)  =  . 29810163e-06 

Pi  (24 )  =  .66347070e-07 

pi  (25)  =  . 12657046e-07 

Pi  (26)  =  .  20121563e-08 

Pi  (27)  =  .  25590634e-09 

pi  (28)  =  .  24409656e-10 

P i ( 29 )  =  .  155  22119e-ll 

Pi (30 )  =  . 49352636e-13 

sum  of  p(i)  is  1.00000000 
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Do  you  want 


to  derive  the  response  time  distribution’ 

-  type  yes  or  no  - 

yes 


Derivation  of  Response  Time  Distribution  *** 
number  of  processors  =  1' 
effective  percentile  throughput  in  =62.90 
number  of  users  =  30 


average  execution  time  of  a  job  in  seconds  =0.4 
average  user  think  time  in  seconds  =20 


t  ime 


dens  i  ty 


cumu lat i ve 


t  -  0.0  p  = 

t  =  .5  p  = 

t  =  1.0  p  = 

t  =  1.5  p  = 

t  =  2.0  p  = 

t  =  2.5  p  = 

t  =  3.0  p  = 

t  =  3.5  p  = 

t  =  4.0  p  = 

t  =  4.5  p  = 

t  =  5.0  p  = 

t  =  5.5  p  = 

t  =  6.0  p  = 

t  =  6.5  p  = 

t  -  7.0  p  = 

t  =  7.5  p  = 

t  =  8.0  p  = 

average  response 


.281582188 

.262502369 

.240528617 

.216799615 

.192369802 

.168154361 

. 144896636 

.123155845 

.103311582 

.085580618 

.070041458 

.056662548 

.045330858 

.035873406 

.028105280 

.021798311 

.016745327 

time  =  2.6 


cum  = 

0.000000000 

cum  = 

.136165990 

cum  = 

.262020107 

cum  = 

.376402460 

cum  = 

.478703693 

cum  = 

.568808913 

cum  = 

.647018701 

cum  = 

.713959388 

cum  = 

.770491533 

cum  = 

.817623936 

cum  = 

.856438048 

cum  = 

.883025880 

cum  = 

.913442105 

cum  = 

.933670118 

cum  = 

.949600466 

cum  = 

.962019749 

cum  = 

.971667707 

923250 

seconds 

r  2119  9.451  68+60 


NOTATION 


Following  is  a  partial  list  of  the  symbols  used  in  this  thesis. 
Only  those  symbols  which  have  a  global  meaning  throughout  the  text  are 
listed  here  alphabetically. 


a  -  degree  of  sharing  of  resident  programs, 
b  -  degree  of  sharing  of  non-resident  programs, 
bj  -  j-th  block  of  a  program, 
fj  -  j-th  page- frame  of  primary  memory, 
m  -  size  of  primary  memory  space  (Chapter  2). 
number  of  processors  (Chapters  3  -5) . 
mf  -  size  of  primary  memory  space  reserved  for  resident 
supervisor  programs. 

m^  -  size  of  primary  memory  space  reserved  for  non-resident 
programs . 

u  -  size  of  a  program  (Chapter  2). 

number  of  primary  memory  units  (Chapters  3  -5). 


n  -  number  of  page  faults. 

P 

p  -  missing-page  probability. 


p^  -  i-th  page  of  a  program. 


p^j  -  conditional  probability  of  a  reference  to  a  page  p^,  given 
that  a  page  p^,  was  referenced, 
q  -  degree  of  multiprogramming. 


q*  -  optimum  degree  of  multiprogramming. 
rt  "  page  reference  at  time  t. 
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fc0  ”  un^  f line  of  a  discrete-time  Markov  process. 
tm  "  misceHaneous  overhead  time, 
t  -  paging  overhead  time. 

tu  "  headway  between  page  faults  (user's  useful  work  within  a 
tbpf)  . 

u  -  utilization  factor  of  a  processor. 

’  effective  slow  down  factor  of  the  processor  speed  due  to 
data-base  lockout. 

Km  ”  effective  slow-down  factor  of  the  processor  speed  due  to 
memory  cycle  interference. 

M  -  total  size  of  primary  memory. 

N  -  number  of  interactive  terminal  users. 

Ng  -  number  of  Markovian  states. 

P  -  set  of  pages  of  a  program. 

-  execution  time  of  j  user  job. 

-  response  time. 

Tt  -  user's  think  time. 

Tw  ”  waiting  time  in  the  job  queue, 

1/a  -  mtbpf 

1/0  -  mpft 

Y  -  memory  cycle  interference  coefficient. 

6  -  miscellaneous  overhead  coefficient. 

9  -  (effective)  percentile  throughput. 

0(N)  -  effective  percentile  throughput. 

Sq  -  percentile  throughput  of  the  system  under  multiprogramming 
of  degree  q  . 
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6  -  throughput. 

x  -  e<N)\0 

1Aq  -  average  execution  time  of  a  user  Job  (T^)  . 
1/p  -  average  think  time  of  a  user  (Tt) . 
n  -  steady-state  state  probability, 
p  -  pA 
o  -  a/p 


GLOSSARY 


This  glossary  is  a  partial  list  of  terminology  used  in  this  thesis 
and  is  intended  to  remind  readers  of  the  meaning  of  each  term  appearing 
throughout  this  thesis. 

data-base  lockout:  multi-processor  interference  caused  by 

unavailability  of  a  shared  writable  data-base  which  cannot  allow 
simultaneous  accesses  of  multiple  processors, 
data-base  lockout  idle  time:  the  processor  idle  time  caused  by  data¬ 
base  lockout. 

effective  percentile  throughput:  the  average  of  percentile  throughput 
(see  Eq.  (4.2.1)). 

fully  loaded  system  (heavily  loaded  system):  if  the  system  under  multi¬ 
programming  of  degree  q  has  almost  always  q  eligible  Jobs,  the 
system  is  said  to  be  fully  (or  heavily)  loaded, 
global  PRA:  a  page  replacement  algorithm  for  which  the  page  for  removal 
may  be  selected  globally  from  any  area  of  primary  memory, 
local  PRA:  a  page  replacement  algorithm  for  which  the  page  for  removal 
is  selected  from  the  pages  of  the  user  process  which  necessitated 
that  page  replacement. 

LRU  PRA:  the  PRA  which  chooses  the  least-recently-used  page  for  removal. 
LFT  PRA:  the  PRA  which  chooses  the  page  with  the  longest  expected  time 
until  next  reference. 

macroscopic  paging  performance  model:  the  model  of  dynamic  paging 

performance  of  programs  based  on  the  model  of  sharing  (see  Section 
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2.5.1)  and  the  linear  paging  model  (see  Section  2.5.2). 
memory  cycle  interference:  multi-processor  interference  caused  by 

unavailability  of  a  memory  cycle  of  a  particular  primary  memory 
unit . 

memory  cycle  interference  coefficient:  see  page  142. 
memory  interference  id  ,e  time:  the  processor  time  lost  due  to  memory 
cycle  interference  when  two  or  more  processors  simultaneously 
access  the  same  unit  of  primary  memory, 
miscellaneous  overhead  time:  the  system  overhead  time  required  to 
handle  miscellaneous  faults  such  as  segment  faults,  protection 
faults,  various  non-paging  interrupts,  etc. 
miscellaneous  overhead  coefficient:  see  Eq.  (2.5.11). 
mpft:  mean  page  fetch  time,  i.e.,  the  mean  length  of  time  required  to 
fetch  a  missing  page  from  the  secondary  memory  system, 
mtbpf :  mean  time  between  page  faults,  i.e.,  the  mean  length  of  time 

during  which  program  is  executed  continuously  without  a  page  fault 
(see  Eq.  (2.5.9)  also). 

multiprogramming  idle  time:  the  processor  time  lost  when  processors 
under  a  full  load  do  not  have  executable  eligible  user  jobs, 
multiprogramming  of  degree  q:  a  multiprogramming  algorithm  which  allows 
a  maximum  of  q  processes  to  be  simultaneously  eligible, 
multi-processor  interlerence:  interference  of  multiple  processors  such 
as  memory  cycle  interference  and  data-base  lockout, 
pagination:  determination  of  a  set  of  program  blocks  to  be  contained  in 

each  page  of  a  program. 

the  collection  of  all  the  system  overhead  times 


paging  overhead  time: 
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required  to  process  a  page  fault. 

partitioned  memory:  '  memory  allocation  of  primary  memory  such  that  one 
area  of  primary  memo-y  is  solely  reserved  for  each  user  process, 
percentile  throughput:  'he  percentage  of  a  system's  computational 
capacity  utilized  for  users'  useful  work, 
percentile  response  time:  the  time  limit  whic;.  '’uarantees  that  a 

certain  proportion  (e.g.,  90  percent)  of  response  ^mes  is  shorter 
than  that  limit. 

PP  discipline:  preemptive  priority  discipline  (see  Section  3.3.3). 
(q;m,n)  configuration:  a  hardware  configuration  of  a  processing 

system  which  Involves  m  processors  and  n  primary  memory  units,  with 
q  eligible  user  processes  under  multiprogramming, 
response  time:  the  time  elasped  between  the  receipt  by  a  computer 

system  of  a  user's  specified  job  request  and  the  satisfaction  of 
that  request  at  the  terminal;  for  the  total  system  model  of  this 
thesis,  it  is  defined  to  be  the  total  length  of  time  spent  on  the 
processing  system  by  each  user  job. 
thrashing:  excessive  competition  among  user  processes  for  primary 

memory  space  leading  to  a  less  than  optimum  use  of  system  resources, 
throughput:  the  average  number  of  user  jobs  than  can  be  completed  per 


unit  time. 
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